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OTATUS DETERMINATION 
Bacl^oimd of the Invention 

The present invention relates to a method and apparatus for detennimng the status of a 
5 subject^ and in particular for determining the ability of a subject such as a horee to compete 
in a sporting and/or racing event. 

Description of tbe Prior Art 

The reference to any prior art in this spedfication is not, and should not be taken as» an 
10 acknowledgement or any form of suggestion that the prior art forms part of the common 
general knowledge in Australia. 

A condition of a performance animal, for example a racehorse, may typically be 
determined by conventional means such as a blood profile test and clinical appraisal. 
15 However, these tests are of limited value because a correlation between results of a blood 
profile test or cUnical appraisal and a condition or state of a performance animal is 
minimal, 

A blood profile test may be suitable for providii^ some information in relation to an 
20 animal that is clinically diseased or iD, but is rarely suitable for determining fitness to 
perform of an animal, particularly if the animal is healthy according to use of current 
clinical appraisal methods, and particularly if the animal caimot communicate information 
about its condition. Although blood profile tests are relatively inexpensive and easy to 
perform, they do not provide assessment of a wide range of conditions, correlations 
25 between test results and conditions of performance animals are poor, are limited to 
assessment of a few diseases, and are sometimes only useful in assessment of advanced 
stages of disease \^*ere clinical intervention is too late to prevent significant loss of 
performance. 



30 Alternative diagnosis or assessment procedures are often complex, invasive, inconvenient, 
expensive, time consuming, may ^ose an animal to risk of injury from the procedure. 
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and often require tianspoit of the animal to a diagnostic centre, 

A final leport of the results of a blood test to an end user, eg. a trainer, often requires 
involvement of multiple parties each providing separate input to the report. For example, a 
. S Veterinarian may collect a blood sample, fbc sample is transported or sent to a laboiatofy 
for analysis, personnel in the laboratory perform an analysis using machinery on the blood 
sample, automated results from the analysis, with or without a veterinary pathologist 
inteipretation, are returned to the veterinarian who then interprets the results and provides 
a separate lepon to the trainer. The process is laborious, time consuming, subject to error 
10 and interpretation bias and may or may not contain information relevant to the end user. 

Bioinformatics may be used with genetic based diagnosis of an animal^s health. 

Currently, it is known to use genetic information in determining information regarding an 
15 individual. This can be achieved in a number of ways depending on the information that is 
desired. 

Thus, for example, WO 01/25473 describes a method of characterising a biological 
condition or agent using calibrated gene expression profiles. In this case, when a subject is 
20 suspected of having a condition, a test is performed to obtain a specific profile, which is 
then analysed. In particular, the collected projSle is compared to a predetermined profile to 
determine if the condition has been correctly identified. However, this sutfers from 
drawbadcs in that a preliminary diagnosis is required to allow the correct test to be 
performed. 

25 

US 6,287,254 describes a system that allows users to perform DNA genetic profiling to 
determine the susceptibili^ of a subject to a condition. In particular, in this example, the 
subject is profiled to determine the presrace of predetermined genes, which in turn indicate 
the susceptibility of a subject to a respective condition. Again, this requires specific tests 
30 for specific conditions, and only allows the suscq>tibility of a subject to be determined. 
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Sammaiy of the Present Inventton 

In a first broad form the piesrat invention provides a method of determining the status of a 
subject^ the method including: 

a) Obtaining subjea data, the subject data including one or more parameter values, at 
5 least one of the parameters being indicative of the cunent biological status of the 

subject; 

b) Cofrq)aring the subject data to predetermined data, the predetermined data 
ixiduding for each of a number of individuals: 

i) One or more values for the parametm for the respective individual, at least 
10 some of the individuals having a number of condidons relevant to the status of 

the individual, the number of parameters being statistically suffident to 
distinguish each of the conditions; and, 

ii) An indication of the status of the respective individual; and, 

c) Detcmrining the status of the subject in accordance with the results of the 
15 comparison. 

The number of parameters is typically greater than about 100, and preferably between 
about 1000 and about 6000. As used herein, the term ^'abour refers to values (eg., 
amounts, concentration^ time etc) that vary by as much as 30%, preferably by as much as 
20 20%, more preferably by as much as 10%, even more preferably by as much as 5%, and 
still even more preferably by as much as 1% to a specified or reference value. 

The method typically includes determining any conditions displayed by the usen 

25 The method can also include determining the ability of the subject to perform in a sporting 
and/or racing event in accordance with any determined conditions. 

The method of perfonnir^ the comparison usually includes: 

a) Obtaining a set of templates, the set of templates representing differences between 
30 groups of individuals; and, 

b) Using the templates to classify the subject data into a respective one of the groups. 
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The method may further include detenniiung one or more conditions displayed by the 
subject in accordance with the determined group. 

5 

The parameters can be representative of the level or abundance of an agent In the subject 
or in a biological sample obtained horn the subject. The agent may be selected from one or 
more of: • - 

a) A nucleic acid molecule; 
10 b) A proteinaceous molecule; 

c) A carbohydrate; 

d) A lipid; 

e) A drug; 

Q A chemical; 
15 g) Aga^ 

h) A cell; 

i) A pathogenic organism; and» 
j) A non pathogenic ozganism. 

20 By "obtained froiri* is meant that a sample such as» for example, a nucltic add extract or 
polypeptide extract is isolated from, or derived from, a particular source. For example, the 
extract may be isolated directly from a tissue or a biological fhiid isolated directly from the 
subject 

25 Other parameters can be measured however, such as the near IR of the subject's blood, 
general measurements^ such as temperature, or other biological indicators. 

The method usually includes: 

a) Receiving confirmation of the determined status; and^ 
30 b) Updating the predetermined data in accordance with the confirmed status and the 
subject data. 
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The predetermined data can include phcnotypic information of the individuals^ and the 
subject data can include phcnotypic information regarding the subject, the phenotypic 
information including details of one or more phenotypic traits. 

5 

In this case^ the method can include comparing the subject data to predetermined data for 
individuals having one or more phenotypic traits in common with the sutyject. 

In a second broad form the present invention provides apparatus for determining the status 
10 of a subject, the apparatus induding a processing system adapted to: 

a) Obtain subject data, the subject data induding one or more parameter values, at 
least one of the parameters being indicative of the current biological status of the 
sulyject; 

b) Compare the subject data to predetermined data, the predeteimined data including 
15 for each of a number of individuals: 

i) One or more values fbr the parameters for the respective individual, at least 
some of the individuals having a number of conditions relevant to the status of 
the individual^ the number of parameters being statistically sufficient to 
distinguish each of the conditions; and, 
20 U) An indication of the status of the respective individual; and, 

c) Determine the status of the subject in accordance with the results of the 
comparison* 



The apparatus can therefore be adapted to perform the method of the first broad torn of 
25 the invention. 

In a third broad form the present invention provides a computer program product for 
determining the status of a subject, the computer program product including computer 
executable code which when executed on a suitable processing system causes the 
30 processing system to perform the method of the first broad form of the invention. 
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In a fourth broad form the present invention provides a method of allowing a user to 
determine the status of a subject, the method including: 

a) Receiving subject data from the user via a commum*cations network* the subject 
data indudiog one or more parameter values, at least one of the parameter being 

S indicative of the current biological status of the subject; 

b) Causing the base station to: 

i) Compare the subject data to predetermined data, the predetermined data 
including for each of a number of individuals: 

(1) One or more parameter values for the respective individual; and, 
10 (2) An indication of the status of each individual; and, 

ii) Determine the status of the subject in accordance with the results of the 
comparison; and, 

c) Ttansferring an indication of the status of the subject to the user via the 
communications network. 

IS 

The method generally includes: 

a) Having the user determine the subject data using a remote end station; and, . 

b) Transferring the subject data from the end station to the base station via the 
communications network. 

20 

The base station can include first and second processing systems, in which case the method 
can include: 

a) Transferring the subject data to the first processing system; 

b) Transferring the subject data to the second processing system; and, 
25 c) Causii^ the second procesdng system to perform the comparison. 

The method may also include: 

a) Transferring the results of the comparison to the first processing system; and, 

b) Causing the first processing system to determine the status of the subject 

30 

In this case, the method preferably includes at least one of: 
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Transferring the subject data betwcCT the Gommunicatioiis network and the first 
processtng system through a first firewall; and, 

Transfctrmg the subject data between Che first and the second processing systems 
through a second firewall. 

5 

The second processing syslem may be coupled to a database adapted to store the 
predetermined data, the method including: 

a) Querying the datat>ase to obtain at least selected predetermined data from the 
database; and» 

10 b) Compare the selected predetermined data to the subject data. 

The second processing system can be coupled to a subject database, the method including 
storiAg the subject data in the subject database. 

It is also possible to implement any one of the features of the first broad form of the 
invention. Thus, for example* the status may include details of any conditions of the 
individuals, in which case the method can include determining any conditions displayed by 
the user. The method may also include detennining the ability of the subject to perform in 
a sporting and/or racing event in accordance with any determined conditions* 

Similarly, the method of performing the comparison typically includes causing the second 
processing system to: 

a) Obtain a set of templates, the set of templates representing differences between 

groups of individuals; and, 
b> Use the templates to classify the subject data into a respective one of the groups. 

The method can include having the user determine the subject data using a secure array, 
the secure array having a number of features each located at respective position on the 
anay, and a respective serial number. In this casc^ the method typically includes causing 
3 0 the base station to: 

a) Determine the serial number from the subject data; 



a) 
b) 



15 



20 



25 
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b) Dctemiiiie a layout indicating the position of each feature on the array; 

c) Deterznining the parameter values in accordance with the determined layout, and 
the subject data. 



5 The method may also include: 

a) Receiving confirmation of the determined status from the user; and, 

b) Updadng the predetermined data in accordance with the confirmed status and the 
subject data. 

10 In this case, the features can include at least one of: 

a) An ohgonucleotide; 

b) A peptide; 

c) An antibody; 

d) A carbohydrate; 
15 e) A lipid; 

f) A cell; and, 

g) An organism. 



20 



The method can also include causing the base station to: 

a) Determine payment information, the payment information representing the 
provision of payment by the user; and, 

b) Perform the comparison in response to the determination of the payment 
information. 



25 In a fifth broad form the present invention provides a base station for determining the 
status of a subject, the base station including: 

a) A store method for storing predetermined data, the predetermined data including 
for each of a number of individuals: 

i) One or more parameter values, at least one of the parameters being indicative of 
30 the current biological status of the individual; 

ii) An indication of the status of the individual; and. 
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a) Obtain a set of icmplatcs, the set of templates representing differences between 

groups of individuals; and, 

b) Usethetemplatestoclassifythesul^cctdataintoaicspecUveoQeofthegroups. 

5 The method may further indude determining one or more conditions displayed by the 
subject in accordance with the determined group. 

The subject data may be determined using a secure array, the secure array having a number 
10 of features cadi located at respective position on the array, and a respeaive serial number, 
the processing system being adapted to: 

a) Dctwminc the serial number from the subject data; 

b) Determine a layout indicating the position of each feature on the array; 

c> Determining the parameter values in accordance with the determined layout, and 
15 the subject data. 

The processing system can be adapted to: 

a) Receive confirmation of the determined ability; and, 

b) Update the pt«determined data in accordance with the detennined ability and the 
20 subjea data. 

The base station of the fifth broad form of the.invention may therefore be ad^ted to 
perform the method of the fourth broad form of the invention. 

25 In a sixth broad form the present invention provides a computer program product for 
determining the status of a subject, the computer program product induding computer 
executable code which when executed on a suitable processing system causes the 
processing system to perform the method of the fourth broad form of the invention. 



30 



In a seventh broad form the present invention provides an end station adapted to 
the status of a subject, the end station including a processor adapted to: 
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. . *, ^ A* ..«:^ via a coromuiiications network, the subject 
Determine subject data ftom the user v»a a coromuui 
8) weierm «,™„^ values, at least one of the parameter being 

data including one or more parameter values, 
iodicativeofthecunenibiologicalstamsoftbesubject; 

jnaicauvcw* a communications network, the 

b) Transfer the subject matter to a base station via a commumca 

base station being adapted to: 
predetermined data including: 

(1) one or more parameter values forthe respective individual; and, 

(2) Anindicationofthestatuaofeachindividual;and, 

* »,Mfict in accordance with the results of the 
ii) Determine the status of the subject m aocoiu»«^ 



10 
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30 including tor each ofa number of individuals: 

i) Oneormoreparametervaluesfortherespectiveindividualjand. 
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Thc processing system is generally adapted to perform the method of the ninth broad form 
of the invention. 

In an eleventh broad fonn the present invention provides a computer program product for 
5 determining the ability of a subject to perform in a sporting and/or racing event, the 
computer program product induding computer executable code which when executed on a 
suitable processing system causes the processing system to perform the method of the 
ninth broad fonn of the invention. 



10 In a twelfth broad form the present invention provides a method of providing secure arrays 
for use, each array including a number of predetermined features, the method including: 

a) Determining a number of resfpective feature layouts, each layout representing the 
positiora'ng of each feature on a reactive array; 

b) Determining a number of serial numbers, each serial number corresponding to a 
15 respective layout; 

c) Generating a number of arrays, each anay being generated in accordance with a 
respective layout, and including the corresponding serial number thereon, the serial 
number being used in processing the array. 

20 The method can be performed to provide the arrays on behalf of an entity, the method 
including providing an indication of the layouts and corresponding serial numbers to the 
entity, to thereby allow the entity to process the arrays. 

The method of dctcrmixung the layouts typically includes: 
25 a) Determining a preferred layout; and» 

b) Moving the position of one or more of the features from the position in the 
preferred layout to alternative position. 

The method can include: 
30 a) Determining the type of each feature; and, 

b) Exchanging the position of one or more features having different feature types. 
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Brief Description of the Drawings 

An example of the present invention will now be described with reference lo the 
accompanying drawings^ in which: - 

Figure 1 is a schematic diagram of an example of a processing system for implementing 
the invention; 

Pigure 2 is a flow chart outlining the process implemented by the system of Figure 1; 
Figure 3 is a schematic diagram of an example of a distributed architecture for 
implementing the invention; 

Figure 4 is a schematic diagram of an example of one of tlie end stations of Figure 3; 
Figures 5A and 5B arc a flow chart of the process implemented by the system of Figure 3; 
Figure 6 is a flow chart of the process fox generating templates; 

Figures 7A and 7B are a flow chart of the process of comparing the subject data to the 
templates; 

Figure 8 is a schematic diagram of a second example of a distributed architecture for 
implementing the invention; 

Figure 9 is a flow chart of the process for generaUng secure arrays; and. 

Figure 10 is a flow chart of the process for generating subject daU using the secure anays. 

Figure 11 is a flow diart of the process of data mining; 

Figure 12 is a flow diagram illustrating dataflow steps in a spcdflc example a$ part of a 
computer system capable of delivery of remote diagnostic services; 

Figure 13 is a flow diagram showing steps for diagnosing a condition of an am'mal in 
accordance with a specific example; 

Figure 14 is a diagram illustrating an environment for worldx^ the specific example shown 
in Hgure 13; 

Figure 15 is a flow diagram illustraung stqw for preparing an array in accordance with a 
specific example of the invention; 

Figure 16 is a flow diagram showing steps for determining a nucleic add expression level 
in a biological sample; and 

Figure 17 is a flow diagram fliusirating steps for building a database in accordance with a 
speciric example. 
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Detailed Descriptioa of flie Pvefeixed Embodiments 

An example of the present invention will now be descra>od with reference to Figure 1, 
which shows a processing system suitable for implementing the present inventiott. 

In particular, Hgure 1 shows a processing system 10 including a processor 20, a memory 
21, an optional input/output (I/O) device 22 and an inteHace 23 coupled together via a bus 
24. In use, the interface 23 is adapted to couple the processing system 10 to one or more 
databases shown generally at 11. 



10 



In use. the processing system 10 is adapted to receive subiect data, which is data 
representative of the current biological status of a subjecL The subject data is typically in 
the form of raw data and therefore requires interpretation to allow the status of the subject 
to be determined. This is achieved by having die processing system 10 compare the 
15 subject data to predetermined data stored in the database 11. The predetermined data 
includes data representative of the biological status of a number of individuals, together 
with an indication of the actual status of the individuals when the predetermined data was 
collected. 

20 Accordingly, by comparison of the subject data with the predetermined data, this allows 
the subject data to be interpreted and the cuiT<»t biological status of the subject to be 
determined. 

The manner in which this may be achieved will now be described in outline with respect to 
25 Figure 2. 



In parttcular, ai step 100 the user determines subject data in the form of parameter values 
representing the current biological status of the subject. In particuhir, die parameter values 
represent specific nieasurcments of selected parameters that represent the current 
biological sutus of the subject. It will be appredated that a number of dfiBferem fomw of 
panuneters may be used, as will be described in more detafl below. 
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I. «1I be ,pp,«i.,«i *„ d.«™in.«on of A. p«fc,.rmtoed data is typify , 

which IS shown in Figure 3. or 
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30 within a company or the like «• ^-rnvs 



U-n-02j22 :52 ;OAVIES COLLISON CAVE Pat. ^TrBd ;61 7 3368 2262 # 22/110 

A- 



18 



Whilst this technique describes transferring the data electronically via the communications 
netwoite. it Will also be possible to transfer data via alternative techniques such as 
iransfening data in a hard, or printed format, as well as transferring the data electronically 
in a physical mediun, such as a floppy disk, CD-ROM or the like. Wii^ess transfer or the 
5 l*cisalsopossible,aswillbeappreciatedbythcpersonskilledintheart. 

In any event, it wiU be appreciated that in this exaniple, the services provided by the base 
stataon 1 a.^ ge«.raUy accessible via the Internet 2. Tie processing system 10 is therefore 
generally capable of generatir^ web pages, or the like, that can be viewed by users of the 
10 end stations 3. Accordingly, the processing system 10 may be any suitable form of 
processing system that executes appropriate application software stored in the memory 21 
to allow .he desired fimctionality to be achieved. TypicaUy however the base station 1 
includes a processing system, such as a network server, web server or the Jike. 

15 Sin^ilarly. the end stations 3 must be capable of communicating with the base station 1 to 
aUow browsing of web pages, or the transfer of data in other mamiers. Accordingly as 
Shown inHgure 4, in this example the end stationssai. formed ftomapm^^ 
includmg a processor 30. a memory 31. an input/output (I/O) device 32 and an interfiu* 33 
coupledtogetherviaabus34. 11.0 interlace 33. which may be a network interface card or 

20 the hke IS used to couple the end station to the Internet 2 or one of the respective I^Ns 4, 

It will therefore be appreciated that the end station 3 may be formed ftom any suitable 
procerus system such as a suitably programmed PQ totemet T««ninal, Up-top, hand 
held PC or the like which is typically opiating application software to enable web 
5 browsmg or the like. Alu^atively, the end station 3 may be fbrmed ftom specialised 
hardware, such as an electromc to^ sensitive screen coupled to suitable processor and 
memo^. In addition to this, the e«i stations 3 may bo comiected to the Internet 2 or the 
LANs 4 via wired or wireless commons, as will be appredated by a person skiUed in the 



30 



Operation of the system to determine the status of the subject wiU now be described in 
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more detail with feference to Figures 5A and SB. 



la particular, as set out in Figiire 5A the nm«.ee - 

A^, ■ - process begjDs at step 200 with the mur 

deiermimng the parameter values for the «,w«<r -n, w wim tiie u<5er 

. vwucs ror tne subject. The parameter values are then <»n,v«**j 

<anbeii.teii>rM6dbytliebaKaati<>nl. «■ raraat mat 

* IV wiarge inc user for the senrjoes provided. 

The user nau,e and password will then be gene«ted and «,b«q«ently verified in ,h 
normal way. Alternatively, identification of the user can bo acZ^edT m 
cookies stored at the end station 3 or an Identf If « ^"^^ 
2s * •*''*""*'*°*'fi"»socaated with the end station 3 wh.>h 
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accessing re^Ve web p^cs generated by the base station 1 Thi. ,t, . 
select the respective ^rviccs leguircd, which in thi. Z T " "''^ 

of asubjcct "^P^^ ^ ^''««<=^««>» Of the status 

Once the user has been authorised, the user wiU be tran*fr 

anow the subiect data to be t.„s«. Jl^e tal"? ^ 
typical,yachievedbyi«p,e.e^,^S3 ^^4^7^^^^^^ ^ ' ™^ 
base station 1 and the end station 3 Ti, T ''"'^ "^"""^^^ '^^^^ 

ensu. .at the subieTrr^TbTr^^^^^^ ^"^'^'^ ^ ' - 

JO retained confidential. ^ ^ **«tion 3 is 



or:rre::-::rr^^^^ 

15 is therefore necessary to ensure the i„ft. 7 ^ ^"^^^^ 

-cessary to ensure the uif6m,ation is retained confidential at aU times. 

After accessing the base station! at st«, 220 th« «.m ^ ^ . 

station 1 at step 230. At this point. ih^T ^ "^"'"^ 

K -^w- w mis poant, ihc base station 1 wiir ti/«v<^n 

* Will oc crcscribed in fiirther detail bei^i^r u 

wrays, in wbich case a number of 
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«.P270. " ""''^ -f^pUyC by 0« subject 

15 

station 3 at step 290. condihons to the end 

20 

well « . . one or m<« ^ 

«^ KiVA (snoRNAX smaJl <^tbplasinic RNA 
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(scRNA), ribosonwi rna (rflNA), ttamlatfonal control RNA ^tcRNTA. ^ . 

Interference RNA (iRNA) and mitochondrial RNA rLNA^ .^^^^ "«<^A) or 
a., contemplates by the present invention ind^ ""^^ 
5 hormones, cytokines. chemoKn^ ^ '^^"^ immunoglobulin^, 

Allexnativc,rrr ^ -"-Peptides, adhesins. glycoprotdns and the like 

/vueniaiivciy, the expression data mav tcImp fh* i-. • ^ 

mclnding monosaccharides k T °^ * carbohydrate 

8 onosacchandes, oligosaccharides and polysaccharides. 

necessarily visible. wentified before symptoms are 



^^^^ ^ ^ , zjTj::::rJr^z 
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respiratory illness, or the like, which can be difficult ^ r 

HOW.W, i. will b. <,,p,ecMttd lh« tfU, is .too ^. 

e <Mccay, but nay ta« „ <„ rtu= tWelc's p«*,ra,a.a. 
10 keep tlM conditfan secnt. (or » , • . "^""e 

a. .-dividual from , LZ^T^ «^«=.n.pl=. i(0» cl»„ce of 

»,«,«o„is™ . J.^.c^ru:rrrr:'"■r'""•'^'" 
Furthermore, the more data available from individuals «.,fF • ^ 
types of conditions. aoiviauajs suffcnng from different 

30 

The number of parameters leqniied win dcneiui *k- ^ 

^ ^ depend on the number of conditions to be 
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dislii^oished. In particular, it wfll depend on fectore such as: 

• The presence and detection of unknown conditions: 

• TJicrangcofconditionstobeidentificdwithinthepopulation; 

• Hie levels of inddence of each condition in the population; 

• The ability 10 distinguish between the conditions. 

It will be appreciated that as individuals, including performance animals such as race 
horses, can suffer from a wide variety of conditions. Aen it is preferable for a large number 
of pammeten. such as 3.000 to 5,000 to be used. However, this number can be 
significantly lower if only a minor m„nber of condiUons are to be identified. Thus for 
^cample, the number of pamzneters used may be anywhere from 10 up to 10.000. or more 
Suitably, the number of parameter, employed are at least about 20. preferably at least 
about 50. more preferably at least about 100. even more preferably at least about 1 50, even 
more preferably at least about 200. even more prefcrably at least about 300, even more 
15 prefenib^ at least about 50^ even more preferably at least about 1000. even more 
preferably at least about 1500. even more preferably at leasr about 2000. even more 
preferably at least about 4000. even more preferably at least about 6000, even more 
preferably at least about 8000. and stiU even more preferably at least about 10000. 

In addition to this, the effect of a condition on an individual may also vary in accordance 
With additional phenotypic information relating to a particular characteristic or set of 
charactenstics of the subjec, as determined by interaction of the subject's genotype with 
the env,«,nment in which it exists. In this embocfiment. such -chaxacterisric data' may be 
selected fiom age, sex. height, fepgth, weight, ethnicity, race, breed of animal, feeding 
patterns exerdse patterns, medication supplied, nutritional or growth supplements 
supphed, nutritional analysis, hair colour. sVin colour, eye colour, body composition, l.t 
compositBon. wafer retention, obesity,transcriptomicprome.proteo^^ 
Ussue type, endocrine function. immunoK^cal fimction including ceUuIar and humorai 
unmune fimction, tolcmnce. allergy, transplant refection, cancer, hyperplasia, 
gastromtestmal fimction, neurological function. Iddney fi^ncuon, heart function, brain 
function, pan^eatic function, bone fimction. Joint function, sexual or reproductive 



20 
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tocfion. ™«.bclic load u»,i^o^ profile, s»b«a.K» ^ i^^, 

iobom ,£ n^fcoBs™. infection i„oludtog w^oo. 

^ "f«<»n. «j«b««rt™ i.^. 

toue rec<.i«™:tf»,^ «l a» Iflc Tl^ ph«,o,ypic i„f„„„aUon may ^ 

««*«^n.d d«. Aoo^y. «„. p^p,«^, ^ prcd.«rn,incd dau is 
g«.»% p.16™,d. 1«. p„«ss g«^^ known data ™.„g and win b. 
*»=ri^>»"-o»d«dlbcIow««i„in„owb.o«Uncdwid.rrftoe,».tt,Hg^ 

In P«iad« « «cp 300 *e opa.«« o, b« , ,p.„^ „ 

P^^c^ed da... inctading g««ypi. ^ 

20 ^*«»^^ft»0.b..»nn»dtt«a.ep.«.,^^^,^.^^ 
^^«apood » p„_ ^..^^ ^ ^ 
" n«o, of «.p««™ .,,p^ p^^ ____ _^ ^ ^ ^ 

23 """lyinKapretedbylliebmeslMioiij. 

^ s«P 310 fl,c p«««i„g ay^ „ „ 

30 s^*„ ,0 u,en u. ^ ^ ^ « 
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■^racnaractoise the groups of individuals idMtffiedTnth^ ^ 
and to Classic i«iivid„a,s ioto these groups. « 

multip]idty of ways of A* - u appreciated that there is a 

efficteot 

It Will be appieciated that the definition of g^ups may be don. f 

in«pective of the phcnotypic tnrits. Tha^ T ""^^^ '"^^'^"^ 

condition will be contained in theZc' ^^'"^ ^^^"^ ^« 

ned ui the sa^c 8„,„p „,espective of each individual^ phenotypic 

However, if diffewnt phenotypic tvoes h»v. 

"~ ""^W ■ n^aaory coudilion, with . 
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to panfcular. u Step 400, the iimt tfaenmnes ™r.»^ 

described above. Processing system 10 as 

if the subject is a male hL .7 . ^ " '^''^ appreciated that 

^ « J,* "P^^^*™**^ dorses havii^ a rcspiratoiy disease. 

However,ifatcinpIatec<)iTespondstoaim>UDofinrf™M i v • 

«> a group of individuals haviag a range of pheootypic 
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s»« -n^ ^ ""^ '■^f"' " N dimensioMl vector 

predeiermincddata, ^" 
It is apparent to those skilled in the »t th^t #k-». • . - 

^ condition. suoject Das the respective 

If Ihe parameter vector is approximately equidistant 
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|an.n««»oclorisno.«n,.»y,rf,h,otterten,,^ P «. f *e sOiject 



A no^ 0, .iten,^ e„ .•mpI«B«,«d to Ih. p™«a„ i„v«tti». 

Multiple Firewall 
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» ««»« to U» d«, o» be ,«.l«d « cortid.«w by a,c «,«,.„, Of U,c 

ThU m n.™ now. ft. op^wr of U» l»« sado. t » co^^e .o p™Wde indicadoo. of 

l»«mcB Of proWdfag „ i«Hcatfo« of a» . f« is pro«cted. 

««« Of d» .„d ««on, 3 a.«n.p. „ a, ^ 

proc«s.„g 10 ,, dowBk»d d«.. «d, » fte .c„,p,^ fto„ a,. „ 

In <»d.r » ovcrcoin. Uu^ tte «do„ 1 i.,,^™^ 

in Ihi. ««np,.. a. 12. 14 wiU 1» «A«„«ally si,nto ,o U>. 

p»cc«„,sy«.„,Od«c««,*.v^.«,^,„„a^^^^.__^^^ 

P^v^n Of ,«ol.^ i. .ohieved ^ p»c«i„g syae. ,i .h. ^ 
r.«.v.»g of d,«, o, oU» «,„«^ U-o ^ ...^ J, 

^ d«e™i.d d. p^„, ^^u^^,^ . ^ 7 - 

transfcircd to the processing system 14. 



20 



30 



Thu^ the processing systen, 12 can receive the subject data, and operate to extract the 
paiameter values thcreftom. TTxc prooessin« svst«m i9 T 

vector or ri,. in, u • V'^'^S system 12 then generates the parameter 
COmDanson with rh#> J vr>~MUMii 



coraparison with the predetermmed data. 
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Once the comparison has been performed, the processing system 14 can determine those 
conditions suffered by the subject and then transfer an indication of this back to the 
processing system 12 through the firewaU 15. The processing system 12 can then transfer 
5 an indicadon of this indication to the end station 3. 

It wDl be appreciated that in this example even assuming the user is able to infiltrate the 
first firewall 13, the user will only be able to access previously submitted requests and the 
results determined therefrom. The prcs^ce of the dual firewall system therefore makes it 
10 virtually impossible for the user to infiltrate the processing system 14 and obtain access to 
the data stored in the database 1 1* 

In the remainder of the description, it will be appreciated that the processing systems 10; 
12, 14 are effectively interchangeable. 

15 

Parameter Ranges 

A further alternative to the present invention is for the comparison to be performed on the 
basis of parameter ranges defined for different conditions. 

20 Thus for example, each condition may have associated therewith a sequence of parameter 
value ranges determined based on ranges of parameter values for individuals diagnosed 
with the respective condition. The parameter value ranges can then indicate for a 
respective condition the parameter values that can be expected, allowing the determined 
parameter values to be compared to the respective ran^ fbr each condition to determine if 

25 the parameter values provided fall within a tcspcdtive range. 

Thus, a respective parameter range can be determined for each condition, with the 
parameter values determined for a subject being compared to each lange^ to determine 
those ranges within which the subject data falls. 

30 . 

An indication of the likelihood of the subject having a respective condition can then be 
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detennined statistically based on tfae number of individuals having the respective 
condition. 

Multi-Lcvd Analysis 

In the example described above, a number of conditions have been defined for the 
respective type of individual. However, it will be appreciated that sometimes it is 
desirable to perform tests to focus on specific conditions. ITias for example, in the case in 
which a -horse has an existing condition, it is sometimes desirable- to monitor the 
development of the condition for the respecUve subject. 



In this case, as the condition has been determined, it will not usually be necessary to 
consider all of the parameter values each time the analysis is performed. In particular, as a 
large number of panunetecs are provided to allow the different conditions to be 
distinguished, a large number of parameters will typically not be representative of the 
IS progress of a specific condition. 

Thus, it is usually possible to identify a number of key parameieni that are relevant to 
respective conditions. Thus for example, conditions relating to respective respiratory 
ilhiesses may be uniquely identified using a smaller number such as 50 parameters, m this 
20 instance, if the user is only interested in examining for the progress of this respective 
condition, the user can simply supply an indication of the values for die respective 50 
parameters. 

In this example, the processing system 10 would operate to compare the determined 
25 parameter values against parameter values of horses sufforing from the condition and 
horses not suffering from the condition. In this situation, the manner in which the 
collection of the parameter values is performed may very. 

In the examples described above it has been mentioned that the parameter values may 
30 includeforexampleexpressiondataconectedusinganarray.forcxamplc. If the arrays are 
to coUect values corre^nding to 5000 parameters it is typical for an array to be provided 
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vdth 5000 features thereon with each feature corresponding to a respective parameter 
Alternatively, tO.OOO features may be pn,vided with two features corresponding to each 
paramcler. fe any event, a person skflled in the art wiU appredate that a number of 
variations on this aie possible. 

However, if only 50 panoncters are to be measured, it is then possible to provide an array 
having 5,000 features with 100 features being used to determine the value for each 
parameter. This aUows the parameter values to be determined fir more accurately 
allowing a more accurate representation of the condition to be detennined. In particular 
moie accuiate comparison of the subiect data with the predetennined data can be 
performed. 



Thus, a typical sequence of events may be for a user to submit a general test having a large 
number of parameters similar to that described above which allows lespective conditions 
15 to be fi«t identified. Once a condition has been identified, the user «n then purchase 
speafically designed array plates adapted to monitor the spedfic condition. Measurements 
of the parameter values relevant to the condition can then be made fer more accurately 
aUowing the progress of the condition to be monitored in detaiL TTus can allow users to be 
provided with information concerning whether conditiomj are improving or not 

20 

Longitudinal Analysis 

In the methods described above, the subject data for a respective subject is compared to 
piedetcnmned data for a number of different individuals. However, in addition to or 
^tematively to this, longitudinal analysis am also be perfonned. In this instance. 'the 
25 subject data is compared to subject data previously collected for the same subject Thus. 

th« allows the progression of a condition within a subiect to be monitored. 

Again, it WiU be appreciated that if this is performed with a Umited mimbcr of panimeteis 
as described in the multi-lcvel analysis described above, then this allows an accumte 
30 assessmentoftheprogrcssionofaconditiontobemade. 
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By storing the results determined for a req>ective subject in the database 11^ for a 
predetermined time period, this can allow the progression of the disease over a time period 
to be monitored and displayed to the user. Thus^ the most recently obtained subject data is 
compared to earlier subject data for the same subject (and optionally predetermined ata), to 
5 determine disease progression. 

Thus, for example, levels of respective parameter values can be used to indicate the 
severity of the disease. This can be aducyed by comparmg the subject data to 
predetermined data in the manner described above, or alternatively using other tedmiques. 
10 As the parameter values vary over time, this can be used to provide an indication of 
whether the condition is improving or worsem'rig. This is turn can be used to monitor the 
effecdveness of any treatment given to the subject* 

Thus for example, if it is determined that a horse has been overtrained then the obvious 
1 5 solution to this problem is to reduce training for a predetermined time period, or resting the 
horse. However, trainers witt generally not want to reduce the training too much as the 
horse will become unfit. Similarly, worse problems can arise if the trainer resumes 
training too early. 

20 Thus, in this case, the trainer can submit subject data on a periodic basis such as every 
week allowing the fitness of the horse to be determined on a weekly basis. An indication 
of this can then be transferred back to the user allowing the trainer to determine when 
training of the horse should resume, or how hard training should be. 

25 This therefore allows the severity of the condition within the subject to be monitored. 
Secure Arrays 

Arrays, or genechip technology determines whether genes are turned "on** or **ofP* under 
certain conditions. A single gene is a stretch of DNA found in the nucleus of a ceH that 
30 encodes, through mRNA, the information required to produce a particular protein. 
Genechips measure the level of mRNA in ceUs and can be direcdy correlated to the 
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amouni of protein being produced in a celL 

In the example of collect subjea data for horses, a horse array and a blood sample are 
needed. The array has DNA dotted onto its surface (DNA of the genes in horse blood 
5 cells). The DNA on the array consists of one strand of the double-stranded DNA molecule 
- the other strand is provided by the blood sample and is labelled with a dy^ 

Two strands of similar DNA wiD only bind to each another (hybridise) if they match in 
sequence. An array reader can determine the amount of mRNA in a sample (gene turned 
10 "on** or ^off) by determining the amount of dyclabelled DNA that hybridises to an array- 

The reader produces a value compared to a reference for every single gene on the array. 
The 5,000 to 10,000 values can then be compared to Genetraks' database. Genes turned 
"on** or "oflP*, individually or in patterns, can then be identified and correlated to the 
IS specific conditions of a racehorse^ 

Various conditions in racehorses will alter the metabob'sm of the white blood cells, which 
can then be detected using the genechip technology. For example, the gene for manganese 
superoxide dismutase (MnSOD) may be turned **ofr in respiratory inflammation. 

20 Similarly, IFNg, IL-4 may also be turned "ofT, and the genes for Grola, IL-8, TNF and 
MIF may be turned "on". This pattern of "gene expressioTi** can be correlated to a specific 
condition, such as respiratory inflammation caused by a virus. Patterns of gene expression 
change as a horse succumbs to or recovers from a viral infection. As the technology and 
database develops, predictions on the stage of infection or influence of treatments can be 

25 made. 

As described above, it is preferably to ensure that the predetermined data is retained as 
confidential. 

30 However, if arrays arc used in the collection of data, it will be appreciated that it would be 
possible to purchase a quantity of arrays and perform data mining of data obtained from 
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At step 650 the army supplier will operate to generate arrays in accordafice with the 
randomised array layouts and the serial numbers. In particular, each genmted array wOl 
have features positioned thereon in accordance with a respective one of the randomised 
5 layouts^ together with an indication of the corresponding serial numben 

It is typical for the anay supplier to produce the arrays in batches with up to 1,000 arrays 
in each batch, with each batch being created in accordance wldi a difference randomised 
layout. 

10 

At step 660 the randomised arrays arc transferred to the users for subsequent use in 
generating the subject data, whilst at step 570 the serial numbers, together with 
corresponding layouts are transferred to the base station 1. 

15 The use of randomised arrays will slightly complicate the production process but will 
vastly increase the security of the arrays. In particular, third parties will be unable to 
utilise the arrays, as the location of features alter, wfaidbi will cause the third parties to 
obtain varying results on different arrays, £6r the same sample. 

20 Operation of the system to use the randomised arrays will now be described with reference 
to Figure 10. In particular, at step 700 the user will obtain a biological sample from the 
subject and then pertorm an assay process using the array at step 710, 

At step 720 the user uses the end station 3 to encode the values obtained from the array as 
25 subject data, together with a serial number indication. The subject data is then transferred 
to the base station 1, in the maimer described above, at step 730. 

The processing system 10 operates to determine the serial number from the subject data at 
step 740. The serial number is then used to access the respective array layout stored in the 
30 database 1 1 at step 750. 
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The array layout will then be used by the processing system to inteipret the subject data, 
and in particular, to determine the rcsptcdve feature to which each value corre^onds. 
This allows the processing system 10 to hence determine the parameter values for the 
respective subject data. 

5 

Operation of the invention will then be substantially as previoudy described above. 

It will be ^predated that the serial number may also be used to check the user is an 
authorised user. In particular, if each user is provided with arrays having a respecdve 
10 serial number (a range of serial numbcis), then having the army supplier provide an 
indication of the user and the serial nuniber(s) to the operator, this allows the operator to 
verify the identity of the user. This provides an audit tndl for the arrays. 

Feedback 

IS A further way in which the present invention may be utilised is to provide feedback on the 
accuracy of provided results. 

In particular, if the base station 1 is used to provide an indication of one or suspected 
conditions in a subject, the user can be requested to provide an indication whether the 
20 diagnosis provided by the base station 1 is correct. This may form a requirement, such that 
a user will only be provided with services by the base station if they agree to this term. 

In any event, the correctness of the assessment by the base station 1 can usually be 
determined by either treating the subject and determining if the tieatment is successful, or 
25 by monitoring the development of the condition over a predetermined time period. Once it 
has been determined that the diagnosis is correct or incorrect, an indication of this can be 
transferred to the base station 1. 

At this point, the rcspcc^vc subject data collected for the respective subject ^n be saved 
30 as predetermined data in the database 11, with the confirmation of the condition beijoig used 
as the indication of the condition in the predetermined data. 
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In Older to achieve this, the processing system 10 is typically coupled to a sample database 
that is used to store the subject data obtained from each subject Once confinnation of the 
conditions is received the subject data and the condition indication is transferred to the 
S predetermined data stored in the database 11. 

It will be appreciated that this checking of the conditions is not essential to the present 
invention as typically the data alone wiU be useful. However, checking of the condition 
will be useful in determim'ng the accuracy of the templates. 

10 

It will be appreciated that as further data is collected over through the feedback technique 
or through the use of alternative data collection methods the templates or other data can be 
updated allowing more accurate condition analysis to be performed. 

15 Users 

It will be appreciated that any individual may use the system. Initially at least however, it 
is necessary for the user to be able to generate the subject data. In the case in which arrays 
are used, for example, diis requires the user to first collect biological material, such as 
blood, and then analyse the material using the array. This is gcneraUy difficult and 
20 requires skiUed operators using existing technology. Accordingly^ the user may have to be 
a skilled technician. However, it is envisaged that collection techniques wiU become 
simpler, allowing the process to be implemented by any user. 

In the case of sporting or radng events, for example, the users could include: 
25 • Athletes; 

• Trainers; 

• Drug testing committees (such as Olympic QfiScials); 

• Medical practitioners (such as vets or doctors); 

• Event organisers; 

30 • Pathology labs (that would typicaUy perform the work on behalf of an individual, 
such as a horse owner). 
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However, this is not intended to be limiting, 
Reports 



10 



15 



20 



25 



30 



5 in general, the indication of any conditions suffered by the user, together with infonnation 
concemiAg the ability of the subject to compete in events, or the like, is provided in the 
fbrmof a report 

It wiU be appreciated by those skiDed in the art, that the content of the report may need to 
be tailored depending on the type of user. Thus, fbr example, a trainer will not be 
inteiested in knowii« about parameter values Ibr their horse, but wfll mther want to know 
what conditions the hoisc has, and the severity. In contrast, if the user is a skilled medical 
practitioner, then there may be some benefit in having more detailed information provided 
thereon. 

Accordingly, the processing system 10 can be adapted to generate tailored reports in 
accordance with report templates stored in the database 1 1, or the memoiy 21. Li this case, 
the processing system 10 wiU determine the type of user, and then access a respective 
report template. The template will specify the type of information to be provided to the 
user, allowing the processing system 10 to populate the report in acooidance with the 
results of the above described analysis. 

nius, for example, in the case of the user being a trainer, the processing system 10 can 
access a user report template, which will include a mimber of fields. 7bc processing 
system will determine ftom the fidd the infonnation wquired, and populate the fields 
accordingly. This may require some additional processing to place the information in the 
required form. The infonnatum wffl also be directed to a level the user can understand, 
and WiU therefore typically avoid the use of technical teims (such as medical tenns) for 
non-technical users. 

Thus, the processing system may be adapted to determine the condition and severity. This 
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is then used to access a look-up table, which indicates how serious the condition is to the 
subject Thus, the LUT may indicate that the condition is serious and medical condition 
should be obtained In this case, the report may therefore indicate merely that the subject 
has a condition and medical attention should be obtained. It wiU be realised that the advice 
may depend on phcnotypic data. Thus, a young horse may be more or less likely to require 
medical treatment for a given condition that an older horse. 



For skilled medical practitioners however, more detail may be required, in which case,the 
processing system may be adapted to imlicate not only the condition and severity, but also 
10 provide an indication of various important parameter values (such as red or white blood 
cell counts), to allow the medical practitioner to detennine what action to take. 

It will be appreciated that the in&nnation displayed may depend not only on the user, but 
aiso the respective condition. Birthermore, the mformation could be displayed graphically 
15 or as numerical or tactual information. 

As the completion of the report template is automated, it will be appreciated that users may 
be allowed to submit their own report templates, in accordance with predetermined criteria. 
aUowing the user to have reports generated in their desired format. 

20 

Finally, the processing system 10 can be adapted to piwide other advi<*. This can include 
for example, recommendations for changes in feeding habits, or the like. In general 
medical advice would not be given due to the issue of liability. However, it wiU be 
appreciated that the operator of the base station 1 could provide a medicaUy trained 
25 individual to provide medical advice if required. 

Architecture 

A range of different architectures may be implemented in addition to those described 
above. Whilst these wiU not be described in detail, it will be appreciated that any form of 
30 architecture suitable for implementing the invention may be used. However, one 
beneficial technique is the use of distributed architectures. In particular, a number of base 
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stations 1 may be provided at lespeaive geographical locations. This can incxeasc the 
effidciwqr of the system by reducing data bandwidth costs and requirements, as well as 
ensuring that if one base station becomes congested or a fault occurs, other base stations 1 
could take over. This also allows load sbaiii^g or the like, to ensure access to the system is 
S available at all times. 

In this case, it would be necessary to ensure that each database 11 contains the same 
information and templates such that the use of different ones of the base stations 1 would 
be transparent to the user. 

10 

Subject Data 

The subject data may be selected from any expression product of the genome or 
characteristic or set of characteristics of the subject whose levels or abundance may vary 
within the subject or between two or more different subjects depending on their status. The 

15 data include, but are not restricted to, biological, physiological and pathological data of the 
subject Examples of biological data include, transcrqitDmic profiles, proteomic profiles, 
enzyme fiuiction, receptor function, and the like. Physiological data may be selected fiom 
age, sex. height, length, weight, ethnicity, race» breed of animal, feeding patterns, ncercise 
patterns, medication supplied, nutritional or growth sujn>Iements suppUed, hair, skin and 

20 eye colour, fat composition, obesity, blood type, tissue type, endocrine function, 
immunological function, gastrointestinal fiinction, neurolt^'cal fiinction, kidney funcaon, 
heart function, brain iimction, pancreatic fiinction, bone function, joint function, 
prosthesis, tissue reconstruction, surgery, pain and the like. Examples of pathological data 
■ndude infectious disease inchiding viral infiectioo, bacterial infection, mycobactetium 

25 infection, parasitic infection, prion function, cancer, transplant rejection, inflammatory 
diseases such arthritis and fibrosis, toxicological profiles, substance abuse induding drug 
dependency and the like. 



Data Mining 
30 Data Mining 
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15 



The system uses a self learning classification sysiem, in which diagnosis is made using a 
iustorical database of test results (the predetermined data), which is updated as each test 
sample (subject data) is recorded. The historical database is typicaUy maintained on a 



server 



Chnical application of the system can be used to diagnose a subject such as an animal witl. 
an unknown clinical or peiformaiioc state. That is. the animal may or may not have some 
disease, or may or may not be race-rcady. A metabolic profile is measured for the-animal 
subject 

In a preferred example, the metabolic profile is comprised of expression signatures 
measured on an oligonucleotide chip. In a preferred exan,ple the metabohc profile is 
co«p«ed with a set of pre-compated diagnostic signatures (templates), and together these 
aie used to predict the health status of the subject In a preferred example, predicaon will 
include probabilistic estimates of uncertainty, and be accompanied by a list of possible 
difiTereotiai diagnoses. 



Diagnostic signatures are computed by data mining a historical database, which contains 
metabolic profile data on subject animals (predetermined data), and associated dinical 
20 information on subject health and perJbrmance status. These historical data use the same 
metabolic profile measurement technique as is used in clinical appUcation. In a preferred 
example, these metabolic profiles are comprised of expression signatures measured on an 
oligonucleotide chip. 

25 I^ataminingmaybeperformedusinganumberoftechniqucsincluding: 

• R«8^rised discriminant analysis for high dimensional data, as described by 
Knveri (1992) Canonical vaiiaie amUysis of high dimensional spectral data, 
Technometrics 34 pp. 321-331. 

. Diagonal discriminant analysis as described by S. Dudoit, J. Fridlyand, and T. P. 
Speed (2002). Comparison of discrimimttion methods fbr the classification of 
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tumois usiQg gene expression data. Journal of the American Staiistical AssociaUon, 
97 (457), pp. 77— 87. 

• Support Vector Machines as described by M P. S. Brown. W Noble Grundy, D Lin, 
N Cristxanini, C Sugnet, T S. Furey, M Ares, Jr., D Haussler (2000) Knowledge- 
5 based analysis of microanay gene expression data by using support vcaor 

machines. Proceedings of the National Academy of Sdencc. 97(l):2<S2-267. and Y. 
Lee, Y. Lin, and G. Wahba (2002) Multicategoty Support Vector Machines, 
Theory, and AppUcarion to the Oassification of Microanay Data and Satellite 
Radiance Data. Technical Report 1064. Department of Statistics, University of 
10 Wisconsin-Madison. 

« Bayesian kernel fitting algorithms. 

• T^ee based recursive partitiom-ng Breiman. L., Friedman, J., Olshen, R., and Stone, 
C. 0.9M) Oassication. augmented by Bagging Breiman, L. (1996) Bagging 
predictions, Machine Learning 26(2) pp. 123-140 and Boosting Breiman, L.(1998) 
15 Arcing classifiers. Annals of Statistics 26(3) pp, 801-849 

It will be apparent, to practitioners skilled in the art. that other data mining procedures may 
be used to rephce those identified above, without materially changing the nature of the 
invention. 



20 



25 



Diagnostic signatures are combined with test subject metabolic profiles to produce a 
diagnosis. In one example, (where data mimng was based on legulaiised or diagonal 
discriminant analysisX prediction is based on a Bayes classification rule, and estimates of 
uncertainty are based on posterior probabilities of class membenihip. 



In another example, (where data mining is based on Support Vector machines), 
classificau'on is based on the support vectors, and uncertainties are estimated bom distance 
of the test profile to the decision boundary. In another example (where data mining is 
based on recursive partitioning) classification is based on the estimated decision tree, or 
30 averaged over multiple decision trees. 
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It will usually be the case that even for an aoimal with an unknown clinical condition or 
performance status, some clinical or performance conditions are known. For example, ii 
may not be known whether or not the animal has disease A. but it is known that the animal 
has disease B and does not have disease C When test samples are recorded, the historical 
database is updated to indode the test sample, and any known concomitant clinical or 
performaTice infonnation. 



It will usually be the case that an animal is tested moic than once during a period of 
investigation. Re-tesiing may occur at a time when an earlier unknown dinical condition 
10 has become known. For the example given above, it may be the case that at a time of re- 
testing for lace-readiness it is known that during the initial test the animal did have disease 
A. Provision is made to aUow updates to and modification of the clinical data obtained for 
eadi test subject, as diagnosis is confirmed or modified. 

15 In one example, data mining is repeated at legular mtervals as the historic database grows. 
Test records added to the historic database wiU frequemly contain only partial clinical or 
performance data. Ftor any given dimcal or perfbnnance fector. data will be filtered to 
remove subjects for whidi the particular diaiactcristic is unrecorded. The data mimng 
algorithm will then be used to construct new diagnostic signatures for the given cbmcal or 
20 performance diaracteristic lite procedure of filtering and mining is repeated tor ead, 
characteristic of interest. In this way. the sample sizes used to obtain diagnostic signatures 
are constantly increasing, and predictive performance impr««res. TLe system becomes self- 
leaming. 

25 The overall process is illustrated by Pigu« lU whidi shows the flow of information and 
processing in the sdf-learning diagnostic system. 

It is apparent that the Historical database must be initialised, and preliminary data mimng 
conducted before dinical application of the diagnostic system. Hie database wiU be 
30 mmalised using a training set comprising data from animals with known metabolic 
conditions. Appropriate experimental design is vital to the construction of the initial 
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trainiiig data set* Empirical predictors derived using data xnining are suscq>tible to 
arte&ctual relatioaships, involving nuisance factors - such as regional differences in diet 
and husbandry. For tins reason, the training data set must be obtained from a multicentre 
trial, and stratified appropriately. 

5 

Specific Examples 

Specific examples are set out in more detail in Appendix A. These are for the puTXK)se of 
demonstration only and are not considered to be limiting* 

10 Persons sJdUed in the art will appreciate that numerous variations and modifications will 
become apparent. All such variations and modifications which become ^parent to 
persons skilled in the an, should be considered to faiU within the spirit and scope that the 
invention broadly appearing before described. 

IS Thus, for example^ the above description has focussed on the testing of a general subject. 
It win be appreciated that this is most advantageously used for performance animals to 
identify conditions that may lead to a decrease in performance. This allows trainers to 
identify problems with horses or other animals before they would be noticeable using 
existing techniques. This is of particular benefit in the horse racing industiy as it allows 

20 problems to be identified in advance» winch can in turn allow the condidons to be 
corrected before they effect the horses performance, which in turn can result in a vast loss 
of earnings for the trainers and owners of the horse. 

However^ the technique may also be apjdicd to any subjects^ including humans. 

. It wiU be appreciated that different predetermined data will be required for each type of 
subject being assessed. 
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Figure 12 is a flow diagram iUustrating one specific example of an infoxmation technology 
architecture and data flow as part of a remote delivery service process* External users are 
5 shown as Class One 505, Qass Two 510, and Qass Three 515 ihat are interested in 
obtaining information regarding their respective gene expression results when using the 
proprietary gene expression analysis service. These users may include, for example, 
pathology laboratories, drug laboratories, pharmaceutical companies, collaborators, 
medical and/or veterinary practitioners or similar, owners of performance animals, athletes 
10 and/or athletic tramcrs. Each of these users 505, 510, 515 will be interested in different 
aspects of the gene expression results and will therefore interact in a different &shion, but 
all will interact remotely via an user inter&ce module 520. 

Interfece 520 may, for example, be a browser-based inter&ce as found on most computers 
15 and delivered via web pages on the world-wide-web (the Internet). The initial interaction 
to the user int^ce module 520 will be via a controlled firewall and web server. The 
firewall will be the first line of defence against unwanted and unauthorised intrusion. Port 
blocking techniques and protocol restrictions will be imposed at the firewalL The firewall 
and web server environment will be fully maintained with the latest security patches to 
20 ensure currency of protection against hackers and intrusion. Each user will establish a 
secure connection 525 (user authentication and establish secure web connection) to rasure 
confidential identification in both directions for the user and service delivery provider. 
The security is managed by a customer access management system 565 that controls access 
of users 505, 510, 515. Such security measures are conmionly used in the art and one 
25 embodiment would be use of SSL (secure socket layer) technology and digital signatures. 
Further security layers can be added at Uiis interface if required and might include 
challengeAesponse component such as continuously changing numerical keys in 
possession of the user and available in plastic card format and trusted networks. 

30 Class One and Two Users 505, 510 are shown sendir^ information as a query 530 and 531, 
that includes a question regarding health or condition status of an animal OAl^^pretation 
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request), sample details, gene expression results, dinical information, pathology laboratory 
results, gene identities, gene sequences, collaboraiive requests, etc Qass Three Users 515 
are shown sending information 535 as a query including interrogation requests regarding a 
health status of individual animaWathletes or groups of individual animals/athletes. 

Queries 530 and 531 may contain formatted gene expression and clinical infonnation as a 
request, one such embodiment would employ the use of digitally signed XML documents 
to ensure authenticity and content of the request. Other authentication, authorization and 
encryption and Iccy management standards wiU be applied as they become available. 



As a further security measure to protect central databases 590, ftom outside unauthorised 
access, queries are temporarily stored in a transaction staging module 540 and queries 532 
and 533 will be drawn into respective pathology service module 550 and collaborative 
services modules 555 only on request firom the service module. This process may employ 
15 a s«»nd firewall and may be configured to further restrict network tr^^^ Tliis firewall 
WiU only permit internal requests from 550 555 560 to pass through the firewall. AU other 
network traffic will be blocked as win unnecessaiy ports and protocols. Respective 
pathology services module 550 and collaborative services module 555 include special 
software capable of servicing requirements of tbz di&rent types of users 505, 510. 
20 Pathology services module 550 and collaborative services module 555 are shown in 
communication with each other. Core central databases 590 store genetic information 
(genetic database) 591, sample and gene expression infonnation (sample database) 593, 
and correlarive data (correlative database & heuristics) 595. The genetic information 
stored in genetic database 591 is used to create gene expression devices Design detaUs 
25 592are also stored in the sample database which contains gene location information on the 
device and are used to interpret results firom such a device. 

The genetic database 591 is also used to provide gene identification and gene sequence 
information to collaborative services module 555 and collaborative services 575 (eg. 

30 interpretations, gene lists and gene sequences) to Class Two users 510. Information in t 
sample database 593 can be clustered together based on similarity using computer 
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15 



20 



algorithms such as K-means, principal oomponent analysis (PCA) aod sdtorganising 
maps, coimnonly available in packages provided by companies such as spotfire. silicon 
genetics, and at higher levels of interpretation, Omniviz, These dusters amount to 
identified correlations 594 between gene expression and sample infonnation and are stored 
in various fonnals, in the coirelative database 595. An heuristic or neural network or rule- 
based computer software system pre-programmed with rules or training sets takes queries 
534 (eg. expression details and sample details), stores these details in the sample database 
593 and then compares the query pattern to those already stored in the correlative database 
595 and produces standardized rq>orts and correlation details 570 (according to the rules of 
the heuristic program). Correlation details are converted to useful information such as 
gene expression conflation results, for example a fuUy formatted report to include 
interpretations 571 and interpretations 575 (and optionally genes lists and gene sequences) 
and are securely delivered back to the requestor via the internet to Class One and Two 
users 505, 510. 



Financials database 597 kc^ track of details including for example accounting, 
purchasing and payroll details. Sales and marketing database 596 keeps track of items 
such as sales and marketing detail^ cliciit details, customer relations management and 
stock management. Internal data warehouse 560 receives information from databases 590, 
596 and 597. This internal data warehouse 560 will only be accessed by authorised 
internal users conductii^ legitimate business activities. A secure (iotcmal) data warehouse 
545 services the needs of Oass Three users 515. Specific (and confidential) information 

580 is extracted from internal dau warehouse 560 that is then stored in secure customer 
data warehouse 545 where authorized users 515 can query 535 (for example as 
25 interrogation requests), specific and confidential information such as clinical history 
information, paflwlogy results and interpretations. This information is presented in a 
secure uaer-friendly andA>r visual format 585 in relation to individuals or groups of athletes 
or performance animals, and/or time series of results. 



30 Figure 13 is a flow diagram of one specific example showing steps for assessing a 
biological sample for diagnosing or assessing a condition of an am'maL A user collects a 
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biological sample 1010, for example a Wood sample from a horse. At the same time, 
biological parameters including biochemical and haematological parameters, clinical data 
(mcludiiig blood profile tests) and appraisal information axe collected and recorded in a 
standard format 1015. for example by filling in a standard form. 'IHe biological sample 
5 1010 is processed so that nucleic acids contained therein are detectable when hybridised 
with a complcmcntaiy (or mismatch-complementary) nucleic acid located on an array 
1020. The nucleic add may be detectable by a label incorpomted therein, for example a 
target nucleic add. P«sfcrably. the array 1020 is a device such as a microarray whid, is 
read 1030 by standard methods and equipmem common to the art to identify and measure 
10 relative abundance or absolute abundance of those nucleic adds from the biological 
sample whid, have bound to probe mideCc adds immobilised as part of anay 1020 
(indusion of a reference sample run in parallel allows for the calculation of the xdativo 
abundance of target nuddc add^ whereas a method developed by the company 
A^metrix, Inc (the "Affymctrix system") as described at their website "afiymetrix.com" 
15 relies on internal rcfeTences), 

Array 1Q20 may comprise a large number of probe nuddc adds, eg. 1000>5 of nucldc 
acids. A large mimber of probe mideic adds may be particularly usefid if an animal is not 
presenting with any visible signs of poor condition, eg. overt disease. Accordingly, in one 
20 embodiment, labdled torget nudeic adds of a sample are first applied to an array 
comprising a -fuU-screen" of target nudeic adds (eg. l,000's of nudeic add probes that 
represent most or many of the nudeic adds «q>ressed in a sample). Based on results from 
the full-screening, the labdled nuddc add targets may be applied to a sub-set of the full- 
screen, eg. a sdecu«i panel of middc add targets that may be assodated with a particular 
25 condition, for exanq>lc.ieq,iiaioiy diseases, drug consumptioI^ etc. 

Data from the read mi«oairay 1030 and dinical data and appraisal information 1015 is 
formaued 1040 and transmitted via a commum-cations network 1050, for example the 
Internet, to a remote diagnostic server 1060. It wiU be apprcdated that transmission of the 
30 formatted data to the remote diagnostic server 1060 requires less bandwidth than 
ttansmitting database information to the user and less skiU and time on behalf of the user 
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The transmitted data is analysed 1070, for example by comparison to a database of 
previously collected information in relation to clinical information and expression levels 
(felauve ab«ndance) of the nucleic aads applied to the microanay 1020. Also, experts for 
example, bioinformalicists, biologists, doctors, pathologists, and the like may analyse' the 
5 data to provide additional useful information. The analysis enables correlation to a 
condition 80. In this manner, the expression levels (relative or absolute abundance) of the 
nucleic add probes applied to the microarray 1020 arc correlated with previously coUected 
data relating to known conditions stored in a database 1080 and compiled. J 090 The 
database may also store information in relation to an identity of known nucleic acid^ 
10 nucleotide sequence on flie array and/or location of nucleic acids on tiie array its 
biological function and links to other databases. 

Results in relation to health and performance condition are transmitted via a 
communications network 1050 and may also be piovided to the user as a report 1095, for 
15 sample a hardcopy printout or visually on a computer mom'tor. 

The described system has advantages of requiring low bandwidth for transmitting sample 
data and final report between user and remote database/pmcessor, data processing is 
centralised and more efficient, expert analysis of the sample data is centralised, the 
20 computer software may incoipo«te heuristic methods thereby minimising human- 
interaction, the possibility of user and interpretation bias is avoided, and infonnation stored 
m ti.e commerciaUy valuable database is under strict control and docs not require direct 
access by an outside user. TTie steps are described in more detail hereinafter. 

25 ^8««W shows an enviromnent for working the method described in Kgm* 13 A user 

1100. which may be a veterinarian or practitioner, collects a sample 1120 6om an amnaal 

1101. for example a blood sample ftom a horse or athlete. Concurrently, information in 
re^tion to a condition of titc animal is collected in a standard format 1102. The sample is 
~nected, nucleic adds isolated therefiom. prepared and appUed to an a^^ U20andtbe 

JO array rs read by an array reader 1130. Data ftom the array reader 1130 and clinical 
appraisal and condition infonnation 1102 is entered into a computer ami fonnatted by a 
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processor 1140. which may be for example, a laptop computer with a modem. The 
formatted data is transmitted via a commimications network 1150. for example the 
Internet. A remote diagnostic server 1160 receives the transmitted data and the data is 
compared with a database($) 1161 which stores data, for example, data in relation to 
5 nucleic add location on an array, expression level (relative abundance or absolute 
abundance) of a nucleic add hybridised with a ootiespondtng nudeic add on an array, and 
data correlating nudeic add expression levd and performance, health, or condition of an 
animal. 

10 Hgure 15 is a flow diagram illustrating steps for preparing an array. A biological sample 
1210 is coUected from an am'maL Biological sample 1210 may comprise for example, a 
blood sample (preferably white blood cells isolated therefrom), urine sample or tissue 
sample (induding fetal tissues and tissues in various stages of devdopment). A spedfic 
aim of collecting the biological sample is to isolate and sequence as many rdevant genes 

15 from the sample for use on an array. Thousands of nudeic adds may be isolated that may 
form a large number of probes for a broad screening of an am'mal's genetic make-up or 
gene ocpression pattern. 



Nudeic adds arc isolated from the biological sample. In one instance the sample may be 
20 used to prepare genomic DNA or tissue spedfic mRNA 1223. In another instance RNA is 
isolated from the biological sample 1210 and a cDNA library 1220 is prepared from the 
isolated RNA. Plasmids 1221 comprising cDNA inserts from library 1220 may be 
sequenced 1222 from dther or both 5' and^or 3' end of the nucldc add. Prefewbly. 
scquendng is from the 3' end. Sequences may comprise Expressed Sequence Tags (EST). 
25 If an isolated naddc add does not encode a fiiU-lcngth gene (eg. an EST), a partial nudeic 
add may be used as a probe to isolate a flill-Iengdi nucleic add. Alternatively, or in 
addition, EST sequence information may be compared directly with a sequence database 
1230, for example OenBank. and a search for related or identical sequences performed. 
Putative gene idimtification and fimction 1231 may be determined from a search, for 
example a BLAST search performed in step 1230. By determimng the number of Umes 



30 
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cach gen^ is represented in tbe library, a computer may be progi^mmed to enable the 
normalisation and standardisation of the relative abundance data of mRNAs in a sample. 

Gene-specific oKgonudcotides 1232 may be synthesised using information from EST or 
5 foU-nucIeotide sequence 1222 data. Gene-spedfic oligonucleotides 1232 may be used as 
amplificab'on primes to amplify (step 1224) a region of a corresponding nucleic add. The 
nudeic add «;ed as template to amplify a region of conesponding nudeic add may be, for 
example, isolated plasmid DNA 1221 a«d/or genomic DNA. cDNA or mRNA (eg used 
with RT-PCR) 1223. TTie nuddc add thus prepared can be used directly as the nudeic 
10 acids for attaddng to an array 1240. Amplification produds 1225 may also be generated 
using non-gene-spedfic primers (eg. oUgondT. plasmid sequence flanking a nucleic add of 
interest). Oligonudeotides corresponding to a gene 1232 may also be used on array 1240 
alternatively the oligonudeotide oonrcsponding to known sequence can be buili 
successively nudeotide by nudeotide on a support using A^etrix methodology sud» as 
15 that in US patent no. 5,831,070, incoiporated herein by reference. 

In one embodiment, the step rdatiflg to constructing d>NA 1220 and isolating plasmids 
1221 comprismg the cDNA may be omitted. In this embodiment, isolated genomic DNA 
or tissue spedfic mRNA 1223 is used as a template to make amplification product 1225 by 

20 amphfication using gene^-fic prime« 1232. Amphfication product 1225 may be 
attached to array 1240, 

Nucleic adds attad^ to or buitt onto array 1240 preferably represent most, more 
preferably all, expressed genes in a given tissue from an animal of interest For example 
for a complete diagnostic test for racehorse blood, the array should contain genes 
expressed in the cdte of blood under various conditions and at various stages of cell 
differentiatioxL 



Figure 16 shows a flow diagram comprising steps for determining gene expression in 
30 biological samples comprising both reference target 1305 and sample target 1310. Nuddc 
adds, m particular RNA (total RNA or mRNAX are isolated from biological samples 1305 



14-11-02;22 : 52 JDAVIES COLLISON CAVE Pat. 8iTrad 



:61 7 3368 2262 



# 59/110 



10 



15 



-54- 

aad 1310, Which may be the same sample. cDNA is prepared from the RNA and the 
cDNA is labelled lesulting in labelled targets 1320 and 1325. Alternatively, or in addition, 
CDNA may be used as a template to synthesise labeUed antisensc RNA for use as targets 
1320 and 1325. Reference target 1325 may be provided as a p„.viously prepared labelled 
target of known concentration. Accordingly, reference target 1325 need not be synthesised 
in parallel with each sample target. Internal controls for reference target 1325 and sample 
target 1320 provide a means for normalising and scaling relative probe concentrations. 

Sample target 1320 and reference target 1325 are hybridised with array 1330 in step 1340 
Array 1330 may. for example, have been prepared by steps shown in Figure 15 The 
hybndised anay is washed 1345 to remove non-specific hybridisation of targets 1320 and 
1325. It will be appreciated that one skilled in the an could select different stringency 
conditions of wash 1345 as required. Array 1330 is read in an array reader 1350 to 
determmc relative abundance of RNA in the original sample, which correlates with 
expression of the corresponding gene in the biological sample. 



20 



Figure 17 is a flow diagram illnstrating steps for building a database. Biological samples 
UIO are coUected from animals having specific known condition(s). Preferably a 
statistically relevant number of biological samples 1410 are collected from a variety of 
normal animals to estabUsh a normal reference range of nucleic acid abundance levels 
ms should account for natural variation, including that associated with state of fitness 
sex. age. season, breed and diurnal cha«ges. Nucleic adds are isolated and labelled 1415 
from sample 1410. thereby forming respective target nucleic acids. Tkc labeUed target 
nucleic acids 1415 are applied to array 1420. which may be prepared as described in 
25 Figure 15. n,e array is read 1430 and data formatted 1440 into an electronic form for 
^cample a digital signal, smtable for transmission via a communications network K50 
Chmcal information from dinical appraisal, in illation to conditions of animals of interest' 
IS measured, documented and compfled 1460. n« clinical information is preferably 
coUccted in a standard format, ami for example, variable states such as tte level of fitness 
30 or body score (fatness) may be assigned given a value or number (for example between 1 - 
10). Spedfic clinical conditions may be graded (for example between 1-10) and assigned a 
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unique and standard identifier. An example of such a system is cunenUy used in clinical 
medicine and veterinary science and termed SNOMED or SNOVET (Standardised 
Nomendalurc of Mcdidne or Veterimiy Science), where a dinical condition can be 
described using a numerical system. This system has not been used for describing the 
5 normal condition or the ability of a performance animal to perform to its best A numerical 
grading system could also be used to standardise the collection of such data, for example, 
Ume spent on a treadmill is a strong indicator of exerdsc tolerance, as is blood 
concentration of oxygen and ability to traosport oxygen. Conditions may include disease, 
response to dmgs. training, nutrition and environment The clinical information 14^0 is 
10 formatted into electronic form 1440. for example a digital signal, suitable for transmission 
via a communications network 1450. 

The process is repeated such that a collection of several anay readouts for particular 
conditions are made. A standard range (for example, a population median of 95%) of 
15 values for each of the represented genes and its rciaiivc abundance can be calculated. This 
reference range can then be used as a comparison to test sample results. 

Nudeic add expression information from a read army 1430 for a taigct sample is 
coirelatcd with previously measured conditions 1460 to provide informarion on nucleic 
add expression level (abundance or relative abundance) with any previously measured 
condition. This information is compiled at server 1470 and good data is stored and bad 
data rejected 1480. The compilation process indudes coUocUon of a laigc enough set of 
aixay readout information for a particular condition so that inferences can be drawn on 
gene expression profQes and conditions. Tbc compilation 1470 may also inchide use of 
25 sophisticated pattern recognition and organisational softwaic and algorithms (examples 
common to the art indude algorithms such as K means, Anova and Mann Whitney, Self 
Organisiiig Maps, prindpal component analysis, hieiardrical dustering - any one of which 
is avaflable as part of proprietaiy software padcages) sbdi that expression patterns that 
di^ to normal or expected condition can be identified. The compilation 1470 vdU 

30 Piefcr^Jy indude sophisticated methods of supervised dassilicationsudi as regularised 
discriminant analysis, diagonal discriminant analysis, support vector madiines, or 



20 
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recursive partitioning - .„y one of which is readily conducted using proprietary software 
packages. Concurrently, comprehensive clinical information 1460 for aninials may be 
collected and biological samples 1410 tested on arrays so that correlations can be made 
between any cKnical observation and anay data. In this manner a database is created 
comprisiog data on nucleic acid expi^on which may include data correlating any desired 
conditton. for example normal and specific abnormal co„dition(sX with nucleic add 
egression. mstorcddatal4.0 n^ybe accessed using s^^^^ 

10 Throughout this spedficaaon. unless the context requires otherwise, the words comprise 
comprises and comprising will be understood to imply the inclusion of a stated integer or 
group of integers but not the exclusion of any other integer or group of integers. 

m order that the techniques outlined above may be readily understood and put into 
15 particular preferred embodiments will now be 

iollowuig non-limiting exarr^les. 



STEPl 

Bioloeical S ample CoUactinn 
20 A Wolc^ical con„»«„g „^ ^ ^ ^^^^ 

wo^ be Who,. M<«, . vd„ Of , p.rf„,^„ „ ^ 

b otag.c^ «„pb n^y 1^^ . nuut ^„ assac for 

25 „ ^ u™g«. j,^ fcnu^ or a,on.cic fluid wKid. i- 

comprises CBS ci ,he i™™^ system ft.. ^ 

«.s pres... in blood whid, co,^ ..RNA include n««,rc, io,™,„re «d 
*^ng «e«^ lyn^oy^^ ^ 

o»cr.pb.8«. ^' »">«c«Bw«-s. .wearinesses of oon^loodorigi,., 
30 tmiesmvanouscondiriona. 
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Methods described herein may include use of the abovementioned ceU types. n,e 
biological sa«.ple is collected and prepared using various methods. For example, an easy 
method of collecting cells of the blood is by venipuncture. TT^e biological sample may be 
collected fiom a perfonnance animal, for example, a horse with suspected laminitis, a 
human athlete or camel with osteochondrosis, or a greyhound with subclinical cystitis. 

Blood sample 

Ten ml of Wood is drawn slowly (to prevent hemolysis) from the vein of an animal 
Ou^ar vdn i„ a hon« and camel, veins on the forearm/limb of humans and dogs) into a 
1:16 volume of 49. sodium citrate to prevent clotting and the sample is mixed and then 
placed on ice. The sample is centrifuged at 3000 RPM at 40C for 15 minutes and white 
Wood cells (WBQ (commonly caUed the «bu£fy coat^O are removed f.om the interfi.ce 
between plasma and led blood cells (J^BC) into a separate tube using a pipette. The WBCs 
are then treated with at least 20 volumes of 0.8% ammonium chloride solution to lyse any 
15 ^mutating RBC and re.«ntrifi,ged at 3000 RPM at 40C for 5 minutes. The peUetted 
^ are then washed in 0.9% sodium chloride, re-centrifuged, and kept on ice. ceU 
peUet IS then used directly in RNA extraction. 



10 



20 



25 



Non-blood biological fluid «.mpi> 

A fluid s^^ for example, sputum, urine, bronchial or nasal lavages, joint fluid, 
pcntoneal flmd or tbpmdc flmd, is centrifuged at 3000 RPM at 4 OC for 20 minutes to 
collect ceUs. Samples comprisit« large amom^ts of mucous are treated with a mucolytic 
ag^tsud^asdithiothrctolpriorto^ntrifugatiot. A ceU pellet is then washed in 0 9% 
sodium Chloride, ic<cntrift.ged and the ceU pellet is used direcUy in RNA extraction. 

Tissue biopsy 

A ^e biopsy is ftozen in dry ice or liquid nim^^^^ 

and pestle. THe frozen tissue is then used directly in RNA extraction. 



30 



STEP2 

RNA Isolarioa 
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Total RNA and|/or mRNA is isolated from a biological sample. Use of isolated mRNA 
rather than total RNA may provide results with less backgroiuui and improved signal. 

RNA is commonly isolated by skiUed peisons in the ait, and examples of some methods 
5 for isolating mRNA are described below. 

commercially available kits, for example, Qiagen RNA and Direct RNA extraction kits, 
and RNA extraction kits produced by Invittogen (formerly Ufe Technologies) and 
Amersham Pharmacia Biotech herein incorporated by reference, may be used by following 
the manufecturer's instructions. Key elements of these mRNA extraction protocols 
include use of an appropriate amoum of sample, protection of the sample from RNAse 
contamination, ehition of the sample fiom a column at 700C and quantitation and quality 
checking in an agarose 0.7% gel and using an OD 260/280 ratio. About 0.2 gm (wet 
weight) of pdleted white blood cells or tissue is requited for each mRNA extraction which 
15 will yield about l-2ng of mRNA Disposable gloves should be worn throughout the 
procedure, with frequent changes. Both the column and solution used for elution should be 
at700C. 

RNA quantification and assessment of RNA size and quality include standard gel 
20 electrophoresis methods of ninoing a small quantity of an RNA sample on an agarose gel 
with known standards, staming the gel with for example ethidium bromide to detect the 
sample and standards and comparing relative intensities and size of standard RNA and 
sample RNAs, comparison of the intensities of the tibosomal RNA bands. AltemaUvely, 
or in addition, RNA concentration in a solution may be determined by measuring 
25 absotbance at 260/280 nm in a spectrophotometer relative to known standards and 
calculated usiQg known fonnulas. 



cDNA Synthesis a nd Labfellfiif> 

RNA prepared as described above may be synthesised to cDNA and labelled resulting in a 
30 labelled probe using kits provided by suppliers such as Ameisham Pharmacia Biotech, 
Invitrogen. Stratagenc or NEN, herein incorporated by reference. For example, a typical 
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reaction may comprise: template RNA, an oligo-dT primer and/or gcne-spedfic primcis, 
revcise transcriptase eii^me» deoxyribonudeic triphosphates (dNTPX a suitable buffer, 
and a label incoiporated into at least one of the dNTPs. Such a reaction when combined 
with a method of amplifying the resultant cDNA is referied to as RT-PCR (reverse 
5 transcriptase-polymerase chain reaction). A q>ecific exan^Ie is provided below, but it 
should be noted that other methods of incorporation of label into DNA can be used and 
that such methods are under constant review and impiovement, for example some methods 
include the ioooiporation of amino-allyl dUTP- and subsequent coupling of N- 
hydroxysuccinate activated dye to increase the specific labelling of the DNA. 

10 To aoneal primers) to template RNA, mix 2ng of mRNA or 50-100 pg total RNA from 
respective test sample (Cy3) and reference sample (Cy5> in sqtamte tubes with 4iig of a 
regular or anchored <digo-dT primer or gene-specific primers in a total volume of 15 jil 
(usii^ purified water to make up the volume). (Regular oligo dT is 5'-TTT TTT TTT TTT 
TTT TTT TIT. andiored oligo dT is S'-TTT TTT TmTTTmTT TTV N-S"), (where 

15 V=A, C or G; and N=A, C, G or T), Heat mixture to 6SoC for 10 min and cool on ice. 
Add 15,0 Ml of reaction mixture to respective Cy3 and Cy5 reactions. 

The reaction mixture comprises of the following: 6.0 ul of 5X first-strand buffer, 3.0 ^1 of 
0.1M DTP, 0.6 ul of unlabeled dNTPs. 3.0 ul of Cy3 or Cy5 dUTP (1 mM, AmeishamX 

20 2.0 ul of Superscript n (Reverse transcriptase 200 U/jiL. life Technologies) made to 15 |il 
with pure water. UnlabcUed dNTPs arc souroed from a stock solution corisisting of 25mM 
dATP, 25 mM dCTP, 25 mM dOTP, 10 mM dTTP. 5X first-strand buffer consists of 250 
mM Tris-HCL (pH 83). 375niM KQ, 15mM Mga2). The mixture is incubated at 42oC 
for 1 hr. Add an additional 1 pi of reverse transcriptase to each sample. Incubate for an 

25 additional OJ-1 his. Degrade flie RNA and stop the reaction by adding 15pl of O.IN 
NaOH, 2mM EDTA and incubate at 65-70oC for 10 min. If starting with total RNA. 
degrade the RNA for 30 min instead of 10 min. Neutralize the reaction by adding 15^1 of 
O.IN HQ. Add 380|il of TE (lOmM Tris, ImM EDTA) to a Microoon YM-30 column 
(Millipore). 

30 
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Next add 60jil of Cy5 probe and 60^1 of Cy3 probe to the same microcon. Centrifuge the 
column for 7-8 min. at 14,000 x g. Remove flow-through and add 450 TE and 
centrifuge for 7-8 min. at 14,000 x g (washing step). Remove flow-through and add 450 nl 
IX TE. 20 of spedes-specific Cotl DNA (ZOugAil, Life Technologies for human - Qjtl 
5 DNA is genomic DNA that has been denatured and re-annealed such that the concentration 
of the DNA and die time of re-anneaUng multiplied equals 1. Methods for making Cotl 
DNA are common in the artX 20^8 polyA RNA (10 ugAil, Sigma, #P9403) and 20 jjg 
tRNA (10 Mg/ul, Ufe Tedmologies, #15401-011). Centrifuge 7-10 min. at 14,000 x g. 
The probe needs to be concentrated such that with the addition of other solutions required 
10 for hylMidisarion the volume is not excessive, or is suitable for use with a desired slide and 
cover slip size. Invert the microcon into a dean mbe and centrifoge briefly at 14,000 RPM 
to recover the probe. 

A nucleic acid may be labelled with one or more labelling moieties for detcaion of 
15 hybridised labelled nucleic acid (ie. probe) and target nucleic acid complexes. Labelling 
moieties may include compositions that can be detected by spectroscopic, photochemical, 
biochemicai, immunochemical, optical or chemical means. LabeUing moieties may 
includfc radioisotopes, such as 32P, 33P or 35S, chcmfluminescent compounds, labelled 
binding proteins, heavy metal atoms, spectroscopic markers, such as fluorescent markers 
20 and dyes, magnetic labels. Unked enzymes, and the like. Preferred fluorescent markers 
include Cy3 and Cy5, for example available from Amersham Pharmacia Biotech (as 
decribed above). 

cRNA synthesis and labelling 
15 The Affymetrix system uses RNA as substrate and generates biotin labelled cRNA through 
a series of reactions detailed in a protocol available from their website (affymctrix.com), 
incorporated herein by reference. The cRNA is fragmented prior to application onto the 
array. 



STEP 3 

Arrays 
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One feature is an array comprising nucleic acids representing expressed genes from cells 
found in blood of a perfoimance aninial, for example a horse, human, camel or dog. The 
mideic acids may be of any length, for example a polynucleotide or oligonucleotide as 
defined herein. 



10 



15 



Each nucleic add occupies a known location on an airay. A nucleic add target sample 
probe Is hybridised with the array of nudelc adds and an amount or relative abundance of 
target nuddc add hybridised to each probe in theaaay is determined. 

High-density arrays are usefiil for monitoring gene expression and presence of allelic 
markers wWdi may be assodated with disease. Fabrication and use of high density arrays 
in monitoring gene expression have been previously described, for example in WO 
97A0365. WO 92/10588 and VS Patent No. 5,677,195. all incoqKmited herdn by 
reference. In some embodiments, high-density oligonudcotide arrays ate synthesised 
using methods such as the Very Large Scale Immobilised Polymer Synthesis (VLSiPS) 
described in US F&tem No. 5.445,934, incorporated herein by reference. 



20 



Arrays for human are commcidally available fiom companies such as Incytc, Research 
Genetics, and Affymetrix. Uon Biosdence reccnUy announced forthcoming release of a 
dog microarray and have a done collection of dog cDNAs. These arrays typicaUy 
comprise between 2.000 and 10,000 genes and are spedes spedfic. None are available for 
the horse or camd. Some of these genes are in multiple copies on the array and have not 
been fnUy annotated or given a true gene identity. AdditionaDy, it is not known whether 
DNA on the array, when hybridised to a test sample, spedfically binds to a single gene. 
25 This latter instance results fiom splice variams of RNA transcripts in tissues such that one 
gene may encode multiple transcripts. 

Human and dog anays (when available) can be used in methods described herein. 
However, these anays are currenily non-spedfic and include genes that are not expressed 
30 in blood cdls of animals, and/or do not contain genes important in contfolling the fom:tion 
of blood cdls, and/or comain regions of genes that are not specific to blood cells. 
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Clones containing specific genes are available and can be purchased for homan (mouse and 
dog) for use on arrays (for example from the IMAGE consortium or lion Bioscience). 
However, it is not possible to obtain specific dones for use on a blood-spcciiic array 
5 without prior knowledge of what genes are expressed in Uood cells. The IMAGE 
consoitium also does not guarantee that the gene of interest is contained in the clone 
purchased. 

Array Construction 

10 Because of difficulties, problems and a likelihood of wasting fmandal resources to obtain 
a blood-specific DNA array, a method is provided herein which provides rapid and cost 
effccflve generation of species and Ussue-specific DNA arrays for assessing nucleic acid 
expression in a sample. Figure 14 shows steps bx oonstnicring an array in one 
embodiment. 

15 

Target Nucleic Acid Preparation 

Biological samples are coUected as described above. Samples comprising cells expressing 
as many genes of interest in relation to condition(s) of a performance animal arc collected. 
For example, a sample comprising a mixture of nucleated blood cells from performance 
20 animals with conditions such as, osteochondrosis, laminitis, tendon soreness, bursitis, 
abcesscs, inflammation, allergy, viral infection, parasite infection, asthma, etc. 

Approximately 5 ng of mRNA Is isolated from the biological sample (typically 1 gm wet 
weight) using mRNA isolation kits or the protocol described above. Concurrently, 5 pg of 

25 mRNA is isolated from umbilical cord blood, and/orcady stage foetus. Cells and tissues 
contained within these sources would.exprcss genes that may not be expressed in the cells 
extracted from blood in the above example. Isolation of cytopUtemic mRNA from cells is 
preferred.. This step involves rupturing the cells with a solution comprising dcteigent 
and/or chaotropic agent and salt such that cell nuclei and the nuclear membrane remain 

30 intatt. The cell nuclei are pelleted by centrifugadon and the supernatant is used for mRNA 
extraction. Protocols for this procedure are available as part of mRNA isolation kits (eg 
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availablc by Qiagen). These mRNAs may be used to constnict cDNA Ubraries. Kits for 
the construction of cDNA libraries are available from companies induding Stratagenc and 
Tnvitrogen (eg Uni-ZAP XR cDNA syntheas library construction kit #200450). The 
hT)rafy pirferably should be constructed such that the orientation of the cDNA in the vector 
is known, that the mRNA is primed using oligo dT, the vector is capable of receiving a 
nucleic add insert up to 10 kb and that purification of DNA suitable for ONA sequencing 
is possible and easy. By following the manufacturer's instructions and paying particular 
attention to the quality of mRNA used and the size fractionation of cDNA ^reater than 0.7 
kb), a quality library containing enough viruses (>lxl06) with insert sizes >0,7 kb can be 
generated. 

Plasmids generated from such a library can be DNA sequenced using protocols that are 
well established in the art and are available* for example^ from Applied Biosystems. 
Briefly, a mix of 0.5 of plasmid DNA, 3J2 pmol of a primer that hybridises to the vector 
15 DNA (eg Ml 3 -21, or M13 reverse primer), thennostable DNA polymerase, dNTP and 
labelled dNTP is subjected to a routine PGR procedure to generate fragments of DNA that 
can be separated by gel electrophoresis and using machineiy such as that available from 
Applied Biosystems (eg a 3700 DNA sequencer). Generated DNA sequence data 
(chromatogram) is assessed and quality scores and binning of similar sequences is done 
20 using a computer program package such as Phred/Phrap/Consed. The raw DNA sequence 
data can then be loaded into a database where comments (armotation) on the sequence can 
be made, such as quality score, bin, length of poly A sequence (should there be one), 
BLAST search results, highest homology in Geiibank, clone idcnuty, other entries in 
Genbank. 

25 

Subjective fectors influencing whether a nucleic add should be used on an array include 
quality and confidence of the DNA sequence, a Genbank homology score with identified, 
nucleic acids, evidence of a poly-A tail (indicative of a translated transcript), uniqueness of 
the 3' sequence data (compared to both Genbank and an in-house database of clone 
30 sequences). 



14-t t 
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Nudeic acid pnmers can be selected using a program such as Primer 3 available via the 
Internet <ww-gciiome.wi.zmt.eda/csi-bin/^rimei^runet3). The selected primers may be 
used for ampli^ing a nudeic add, for example by PGR, or directly applied to an array. 
Uniqueness of a nudeic add can be tested by performing addiu'onal BLAST searches on 
5 Oenbank and an in-house database. Primers are preferably designed such that mdttng 
temperatures are similar, and amplification products are of a similar nudeic add length. 
Primers for PCR are generally between 18 and 25 nudeotide bases long. Piimere for direct 
use on a microarray or device are preferably between 50 and 80 nudeotide bases long. 
Both the amplification product and the angle primer should Iqrbridise to DNA that 

10 uniquely idei^fics a gene transcript. %>ecific programs using various foimulas are 
available for calculating the melting temperature of various lengths of DNA (eg Primer 3). 
Alternatively* seleded DNA sequences can be provided to Affymctrix for production of a 
proprietary and custom array. The sequences graeiated in-house are provided to 
Aflymetrix in Fasta format along with details of whidi parts of the sequence to be used for 

15 the generation of a probe set (11 probes, eadi 25 nudeotide bases long) fbr eadi gene 
represented on the array. . 

Nucleotide sequences may be conq)arcd with an existing database fbr example Genbanl^ 
to determine a previously provided name, tissue expression, tinning of expression, 

20 biochemical pathway, duster membership, and possible function or cellular role of an 
expressed nudeic add. In aiklition, a nucleic add fragment may be used as a probe to 
isolate a full-length nuddc add which may encode a ^ne which is assodated with a 
particular disease or condition. Farther, identified nudeic adds may be used to isolate 
homologues thereof indusive <rf orthologues fiom other ^ledes. An identified nudeic 

25 add may also be cloned into a suitable expression vector to produce an repressed 
polypeptide in vitro, which may be used, ft» example as an antigoi in generating 
antibodies and for use on protein arrays. The antibodies may be used fbr developing 
specific diagnostic assays or therapies, for thice-dunensional protein stnidure such as X- 
ray crystallographic studies, or fbr therapeutic developmenL 
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An anay may comprise any number of different nucUac adds, but typically comprises 
greater than about 100. preferably greater than about 1,000, more preferably greater than 
about 5,000 diffeient nucleic adds. An array may comprise more than 1,000,000 different 
nudeic adds. Each nudeic add is preferably represented more than once for scanning 
iniemal comparison and control. Preferably, the nuddc adds are provided in smaU 
quantiHes and arc gene-specific and/or spedes-spedfic usually between 50 and 600 
nucleotides long, arranged on a solid si^ppoit 



The AftymctriJt system uses 11 probes per gene, each of 25 nudeotidca, that are built onto 
10 the anay using a photolithographic method (US Patent Nos. 6.309,831; 6,168,948; 
5,856,174; 5,599,695;. 5,831,070; 6,153.743; 6,239,273; 6.271,957; 6.329,143; 6,310,189 
and 6.346,413). The nudeic adds may be dotted onto the solid support or bound to 
njicrosphcPBS, or in solution. A typical array may have a sui^ce area of less than 1 cm2, 
for example a microanay. 



15 



20 



25 



30 



A nuddc add can be attached to a soUd support via chemical bonding. Furthermore, the 
nudeic add does not have to be directly bound to the solid support, but rather can be 
bound to the solid support through a linker group. The linker groups may be of sufiDdent 
length to provide exposure to the attached nudeic add. Linker groups may indude 
ethylene glycol oligomers, diamines, diadds and the like. Reacdve groups on the solid 
support surface may react with one of the terminal portions of the linker to bind the linker 
to die solid support. Another terminal portion of the linker is then functionalised for 
binding the nuddc add. A solid support may be any suitable rigid or semi-rigid support, 
induding chaiged nylon or nitrocellulose, cheraicaUy treated glass sb'des available fiom 
conq>anies such as NEN. Coming, S&S. arrays avaflable through Affymetrix, membranes, 
fflteis, chips, slides, wafers, fibei^ magnetic or nonmagnetic beads, gels, tubing, plates, 
polymers, micropartides and capillaries. Hie soUd support can have a variety of surface 
forms, such as wells, trenches;, pins, channels and pores, to which the nudeic adds are 
bound. Preferably, the solid support is optically transparent 
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The airay may be constructed using an **aiTaying macfaine^ manufactured by companies for 
example Molecular Dymunics, Genetic Microsystems^ Hitachi, Bioiobotics» Amersham, 
Coming. Alternatively, the anay may be manufiicturcd according to specific instructions 
provided by the user to A^metrix. Source materials for this machine include microtitre 
5 plates comprising nucleic adds representative of unique genes, or sequence information. 
An array element may comprise, for example, plasmid DNA comprising nucleic adds 
specific for a gene sequence, an amplified product using gene-spedfic or non-spedfic 

primers and template DNA or RNA, or a synthesised specific oligonucleotide or 

poJynudeotide. Array elements may be purified, for example, using Sephacryl-400 
10 (Amersham Pharmada Biotech, Fiscataway, NJ.X Qiagen PCR cleanup columns, or high 
performance liquid chromotography (for oligonudcotides). 

Purified array elements may be applied to a coated glass substrate using a procedure 
described in U.S. Pat. No. 5,807,522^ incotpoiated herein by reference. By other example, 
15 DNA for use on Corning amino-sitane coated slides (CMT-GAPSTM) is le-su^nded in 
3xSSC to a concentration of 0.15-0^ yxg/^H and then used direcQy in an arraying machine 
in 96 or 3 84- well plates. 

An example for preparing an array element i^ provided by the manganese superoxide 
20 dismutasc gene. A done comprising a nudcic add insert is prepared and isolated as 
described above. The done is sequenced to idratify the nudeotide sequence. A BLAST 
search using the identified nudeotide sequence is performed to determine homology of the 
cloned nucldc add with nudeic adds in a database, for example GenBank. Identification 
of nucleotide sequence homology with superoxide dismutase genes stored in the database 
25 provides a level of confidence that the done comprises at least in part a gene for 
superoxide dismutase for the horse. Unique primers can be designed to amplify a nudeic 
add using PCR and the done DNA, or genomic DNA from the same spcdes as a template. 
Purified amplification product can be direcOy attached to an array and thereby act as a 
target for a complementary labelled nucleic add probe in the test and reference samples. 
30 Altcmaiively, a unique sequence can be determined and an oliognudeotide manufectured 



14-n-02;22 : 52 ;OAVIES COLLISON CAVE P8t.8.Tr8d 



;61 7 3368 2262 



# 71/110 



-67. 



10 



and purified for direct use on an array, or the sequence information suppHed diiectly to 
Affymetrix for the construction of a custom array. 

The array may comprise negative and positive control samples (preferably as dupUcates or 
triplicates) such as nucleic aa'ds from species different ftom a sample being tested 
(negative controls) and various nucleic acids (representative of RNAs and both ends of 
RNA molecules) that are found in aU tissues as a constant and known quantity ^lositive 
controls). These controls are identified and used by the anay reader to provide data on true 
signal Oe. Specific hybridisation between probe and target) and noise (ie. Non-specific 
hybridisation between probe and taigei) and average intensity from multiple reads of 
several different locations for each nudeic acid attached to the anay. 



A test sample and a reference sample may be simultaneously assayed on the anay. The 
reference sample may comprise mRNA ftom multiple sources, such that most, preferably 
15 aU of the nudeic adds on the airay are represented in the test sample, and can be used by 
the anay reader as a non-zero standard and for comparison with an average of the read- 
outs from the test sample. A relative intensity for each gene on the array can be calculated. 

The relative abundance of expression of each gene in a sample can also be calcuhued using 
20 controls within the anay. such as certain genes expressed in a Ussue at a consuint level 
under all conditions. 



Altcniatively, using the Affymetrix system, an absolute levd of expression is calculated 
based on the difEerence between the perfect matdi and nusmatch hybridisation for each of 
25 the U probes for each gene. Using such a process a gene is scored as present or absent 
and an abscdute measure of intensity is given along with a p value. 

The interpreted array may highlight only a few genes that are subsiantiaUy diderent in 
expression between a test and reference sample. AltemauVely, the overall pattern of 
30 expression may provide a '^geiprinf to characterise the way in which the original cells 
have responded to a particular condition of a performance animal. For example, the gene 
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for superoxide dismutase may be the only gene iq)-regulated in a paiticular condition, 
especially in conditions of infLammation, or a larg» number of genes may be up- and 
down- regulated in various conditions. It is tliis fingeiprint, laflier than spc&&: knowledge 
of gene sequence or function that can be used as a marker for various conditions. It would 
be expected that flQgeiprints be useful across species barriers to include performance 
animals such as humans, hors^ dog and camel. 

Thfr anangcmentof nudeic adds on the array may be periodically changed -and these 
arrays are then assigned a particular batch code that corresponds to a specific array 
comprising a specific nucleic add arrangement. The abib'ty to change the aixangement of 
nucleic adds on the array and knowledge of the exact anangement may prevent other 
people from generatuig a database using the arrays described above. Using a batdi code 
also enables tracking of manufacturers of the arrays in regards to the number of arrays 
produced. The batch code further enables validation of a user of the communication 
15 network or "internet" diagnostic method and system. Batch code can also identify a 
particubir type of array used, should more disease^pedlSc arrays be designed and 
manufiactured. . 

An example of how an array may be prepared and analysed is described in Eisen and 
20 Brown (Methods in Enzymology, 1999. 303 179) and in US Patent No. 6,114,114, herein 
incoiporatcd by reference. Chapter 22 of Ausubel et aL supra also describes methods and 
apparatus for use with arrays and is hecdn incoiporated by reference. 

Control samples may be respectively labelled in paraHel with a test and reference sample. 

25 Quantitation controls within a sample may be used to assure that ampUfication and 
kbdling procedures do not change a true distiibution of nuddc add probes in a sample. 
For this puipose, a sample may indude or be '^iked'* with a known amount of a control 
nudeic add which spedfically hybridises with a control target nucldc add. After 
hybridisation and processing, a hybridisation signal obtained should reflect accurately 

30 amounts of control nuddc add added to the sample. For such purposes, a microanay may 
have internal control?, for example a nudeic add encodii^ a common gene expressed by 
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the performance animal v^rith known expression levels and a nucleic add encoding a gene 
from another species that is known not to hybridise to the test or reference sample. To 
improve sensitivity and spedfidty of the assay, blocking agents such as Cbt DNA from the 
tested spedes may also be used. 

5 

STEP4 

Hybridising Sample Nuddc Add Probes with an Array 

Nudcic add probes may be prepared as described above from a biological sample from a 
10 performance animal that has been assessed concurrently by physical inspection and/or 
blood tests or other method. Nuddc add targets from a statistically rdevant number of 
normal animals previously hybridised to arrays, and a reference range for each of the genes 
on the array is calculated and used as a normal reference range (for example a 95% 
population median). Results from a test sample from a test animal can be compared with 
15 the same genes as the normal reference to determine if the test sample fells within the 
normal reference range. Further, nudeic add targets may also be prepared from biological 
samples from apparentty normal animals, animals with overt disease, various progressive 
stages of disease, hitherto undiagnosed or unclassified conditions or stages of such 
conditions, animals treated with known amounts of drugs (l«Sal or otherwiseX ammate 
20 suspected of being treated with drugs (legal or othcrwiseX animals under specific exerdse 
regimes for the sake of peiformanoc, am*nials subjected to (intentional or not) various 
nutritional states and/or environmental conditions. Databases of information from the use 
of such samples and arrays are created sudi that* test samples can be compared. The 
database will then contain spedfic patterns of gene expression for particular conditions. 

25 

Prior to hybridisation, a nudeic add ptobc may be fri^mcnted. Fragmentation may 
improve hybridisatioD by minimising secondary structure and/or cross-hybridisation with 
another nucleic add probe in a sample or a nudeic add comprising non-complementary 
sequence. Fragmentation can be performed by mechanical or chemical means common in 
30 Uie art. 
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A labelled nadcic add target may hybridise with a complementaiy audeic add probe 
located on an array. Incubation conditions may be adjusted, for example incubation timc> 
temperature and ionic strength of buffer, so that hybridisation occurs with piedse 
complementary matches (high stringency conditions) or with various degrees of less 
5 complementarity (low or medium striQgenqr conditions). High stringency conditions may 
be used to reduce badcground or non-q^ific binding. Specific hybridisation solutions 
and hybridisation apparatus are available commerdally by, for example, Stratagene, 
-Clontech, Geneworks. 

10 Affymetrix have detailed a standard procedure for the hybridisation of probes with an array 
(as describe at their website, affymetrix.com, incorporated herein by reference), however, a 
typical method entails the following: 



Adjust probe volume (prepared as above) to a value indicated in the "Probe & TE" column 
15 bdow according to the size of the cover slip to be used and then add the appropriate 
volume of 20X$SC and 10% SDS. 



Cover Slip Size 
(mm) 


Total Hyb 
Volume (I) 


Probe & TB 
(0 


20xSSC(l) 


10% SDS (1) 


22x22 


15 


12 


2^5 


0.45 


22x40 


25 


20 


4:25 


0.75 


22x60 


35 


28 


5.95 


1.05 



2axSSC is 3-0 M Nad, 300 mM Kaatrate <pH 7.0). 

20 

Denature the probe by heating it for 2 min at lOOoQ and centrifuge at 14,000 RPM for 15- 
20 min. Place the Mitire piobc volume on the array under the appropriately sized glass 
cover slip. Hybridize at 65oC (temperatures may vary when using difGcient hybridisation 
solutions) for 14 to 18 hours in a custom slide chamber (fiw example a Coming CMT 
25 hybridisation chamber #2551). 
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Washigg the Array 

After hybridisation, the anay is washed to remove non-specific probe and dye 
hybridisation. Wash solutions generally comprise salt and detergent in water and are 
commercially available. The wash solutions aie ^plied to the array at a predetermined 
5 temperature and can be performed in a commercially available apparatus. Stringency 
conditions of the wash solution may vaiy, for example from low to high stringency as 
herein described Washing at higher stringency may reduce background or non-specific 
hybridisation. It is- understood that standardisation of this step is required to produce 
maximum signal to noise ratio by varying the concentration of salt used, whether detergent 
10 is present (SDS), the temperature of the wash solution and the time spent in the wash 
solution. 

A typical wash protocol consists of removing the slide tiorn a slide chamber, removing the 
cover slip and pladng the sKde into 0.19&SSC (recipe provided above) and 0.1% SDS at 
15 room temperature for 5 minutes. Transfer the slide to 0.1% SSC for 5 minutes and repeat. 
Dry the slide using centrifugation or a stream of air. Equipment is available to enable the 
handling of more than one slide at a time (for example, slide racks). 

STEPS 

20 Reading the Anay 

After removal of non-hybridised probe, a scanner or "array reader^ is used to determine the 
levels and patterns of fluorescence flx>m hybridised probes. The scanned images ate 
examined to determine degree of hybridisation and the relative abundance of each nucleic 
acid on the array. A test sample signal corresponds with relative abundance of an RNA 

25 transcript, or gene e3q>re$$ion, in a biological sample. Altemativdy. an Af fymetrix array is 
read and computer algorithms calculate the difference between hybridisation on perfect 
match and mismatch probes for each of the 11 probes sets for each gene. It then calculates 
a presence or absence, an absolute value for each gene and a p value for the absolute calL 

30 Array readers are available commercially from companies such as Axon and Molecular 
Dynamics and AfTymetrix. These machines typically use lasers, and may use lasers at 
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different frequencies to scan the array and to differentiate, for example, between a test 
sample (labelled with one dye) and the control or reference sample Oabelled with a 
different dye). For example, an array reader may generate spectral lines at 532 nm for 
excitation of Cy3, and 635 nm for excitation of Cy5- 

5 

A relative quantity of RNA may be calculated by the array reader and computer for 
respective nucleic acids on the array for respective samples based on an amount of dye 
detected, average of duplicate samples for respective genes and subtraction of background 
noise using controls. The reader is pre-progriunmed to perfbrm such calculations (using 

10 proprietary software supplied with the array reader, such as MAS 5.0 for the Af^etrix 
system and Gcnepix for the Axon Instrumrats reader) and with information on the location 
of each nucleic acid on the array such that each nucleic add is given a readout value. 
Controls or reference samples providing a readout for particular nucleic acids that falls 
within standard ranges ensures correct integrity of the array and hybridisation procedures. 

1 5 Programs typically generate digital data and format it for transmission 

STEP 6 

Querying and Transfer of Digital Data to a Central Database 

20 Generated data is transmitted via a communications network to a remote central database. 
A user having access to the gene expression data enters information in relation to a test 
sample into a standard diagnostic form such that it can be digitaKsed* The information will 
include clinical appraisal and blood profile results. The format of such information is 
standard globally such that details on clinical conditions may be based on numerical input 

25 and each field of entry can be digitalised For example, body temperature field could be 
number 0001, a recorded temperanurc within normal range would receive the number 0, 
0.5OC above what is considered to be the normal range for that species would receive a 
number 5, IOC above normal range would receive 10. Some examples of conditions that 
may be scored or rated in such a fashion are provided below. 

30 

a) Body temperature. 
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b) Integument: eyes, soies, abcesses, wounds, in$ectsy/^arasites» aHc^^^ 

c) Cardio/Respiratory: eyes, nasal discharge, rales, viral/bacterial infection, allergy, 
chronic obstructive pulmonary disease, cough/wbeeze, ciepitous sounds in the thorax, 
epistaxis, auscultation sounds, heart sounds, capillary refill, mucous membrane colour. 

5 d) Gastrointestinal: diarrhoea^ colic/stasis, parasites, sqipetite level, drenching time and 
dose. 

e) Reproductive: stageof pregnancy^ aboition, inflammation, discharges. 

f) MuscuJoslccletal: lameness, laminitis, bone or shin soreness^ muscle soreness or tying 
up, tendon or ligament affected, level of pain. X-ray data, scintigmphy data, CAT scan 

10 data« bursitis, bruising, cramping or *^ing up**. 

g) Blood test results: biochemistiy, immunology, serology (viral, bacteriological* 
hormone levels), cell counts^ cell morphology, pathologist inteipretadon. 

h) Other diagnostic test results: X-ray, biopsy, histopathology, CAT scan, MRI, 
bacteriology, virology. 

15 i) Other data: Season (date^ location, male or finale, vaccinatioD history, body score 
(fitness and fat), fitness levcL 

Alternatively, the endre system could be based on the aforementioned SNOMED system 
with appropriate modfflcations to encompass descriptions of ^ercise physiology and the 

20 normal animal. Alternatively, the entire system could rely on text or categorical data that 
can be appraised and scored by software such as Omniviz* Whatever ^tcm is used, if 
would be appredated that the aim is to adequately, systemattcaUy and in a standard manner 
describe the cuTxent condition of the animal to the best of currently available tedmologies 
and could include results from machinery such as X-ray, ultrasound, scintigraphy and 

25 blood analysis* 

The user also ensures that array results (that may fbr example be automatically collected 
from a reader), array specifications, data mining specifications, level of interpretation 
required and the clinical information are entered and correspond to the same animal and 
30 the same sample. The form is transmined electrom'cally to a central database and 
recognised as an individual acc^on or request by the database. The central database 
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recognises the user (using for example digital certificates), the user recognises the central 
database, the array batch code and gene array order are verified, and the user is allowed 
access (which may be automatic) and automatic processing of the request is perfonned if 
security and billing information are adequate. The processing involves spccdiic roim*ng of 
5 central data and specific user requested information is retrieved and resent automatically. 

The above steps may be automated so that a user need not be present to perform the tasks. 
In an automated specific example, gene expression data from an array reader may be 
transmitted via a communications network directly to a server which is connected to a 
1 0 central database. Additional information could be input by the user at a processor which is 
also Knked to the array reader. 

Automated Data Mining Using Sent Data (Heuristic Methods) 

A central database interprets the array specificaiions (eg, nudeic. add order on a 
15 microarrayX decodes the information transmitted, determines nucleic add expression level 
in a biological sample and compares the expression level and patterns of expression with 
known standards or reference range. Various levels of database interpretation may be 
applied to the data transmitted, depending on Oic user requirements. Clusters of genes may 
be t9>-regulated or down-regulated in certain conditions and the database makes automated 
20 correlations to qiedfic conditions by accessing various levels of database information. 

Mining software such as Metamine (Silicon Genetics), ArraySCOUT (Lion Biosdence) 
can be used in this instance, and more advanced data mining technologies could be used to 
identify patterns and nearest neighbour information in data (such as products from AnVil 

25 Informatics Inc and OmniViz Inc). Further, software capable of taking rule-based 
instructions (such as that described by Pacific Knowledge Systems Sydney Australia in 
their •*ripple down'' technology) and having the ability to self learn (heuristics and neural 
network systems) such as that described in Khan et al. Nature Medicine 7 (6) 673, 
incorporated herein by reference, could be used at this stage to limit the level of human 

30 interaction in determining a diagnosis. In this latter example, an arttfidal neural network 
is used, and samples are divided into training and validation sets to create trained 
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calibrated models. The calibrated models are then used to rank genes in diagnostic 
importance. 

Level$ of database may include: 
5 • Unique gene sequences (eg 3' and 5' EST sequence of genes) 

• Gene identity, homologous genes^ tissue expression^ keywords, function, cellular 
rolc» gene clusters, biochemical pathway, PubMed references 

• Primer sequences used to generate amplificadon products (eg two primer sequences 
used to uniquely amplify the gene for gamma interferon in a particular species) 

10 ♦ Microarray construction and format (eg coded information on array manufacture 
batch and identibScation of genes and position on the array) 

• Blood profile and clinical data associated with particular conditions (eg standard 
clinical information and IDEXX-machine generated blood prolile data) 

• Array data for normal and apparently normal status (eg 95% median range for 
1 S normal am'mals) 

• Array data for inducible disease and disease models 

• Array data for various overt diseases (eg joint inflammation) 

• Array data for stages of various overt diseases (eg pre-cUnical, clinical and 
recovery stages) 

20 • Array data for the influence of various classes of drugs» legal or otherwise, of 
known administration and dose^ or unknown administration or dose (eg- various 
steroids) 

• Array data for the response to known and various levels of drugs used as a therapy 
(eg various anti-inflammatory medication at spodac doses for a specific condition) 

25 • Array data for the response to exerdse and various tmining regimes 
« Array data for the response to nutrition and various feeding regimes 

• Array data for the response to the cnviroimu^nt so as to possibly determine 
influence of during various seasons, or allergens or feed types. 



30 Each successive level relies on at least one previous level of database to allow for 
interpretation. Tlie database may be built over time and more intensive searching of the 
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database may incur a greater cost^ As the database grows, changes may be made to the 
above methodology to increase the sensitivi> of the detection of variation in expression of 
condition-specific genes - this could include the use of condition-specific arrays or 
condition-specific primers. Condition-spedfic arrays can be manufactured by a company 
5 such as Affymetrix (under instructions) that would allow for increased sensitivity and 
specificity, much reduced size of arrays, decreased cost of production, and the ability to 
process multiple samples at once. The process of building the database is iterative, such 
that specific genes are correlated to specific conditions, and the detection of variations in 
these genes becomes more sensitive and specific through the use of various modifying 
10 processes through the procedure (eg. the use of gene-spedfic primers for the amplification 
and labelling of cDNA from RNA, and the selection of limited numbers of genes on a 
disease- or condition-specific array, detection of splice variants and single nucleotide 
polymorphisms). 

15 ST1EP7 
Standardised Electronic Reporting 

The database reports back electronically to a remote user, cither automatically or with a 
level of human intervention* The electronic report may be converted to a printed 
document. The r€|>ort provides details of an am'mal^s condition that is determined by 
20 correlation of gene expression data with infiDrmation stored in a remote database, and 
optionally expert analysis. 



Information sent might include: 

- Individual genes up-regulated or down-regulated (for example, with kuninitis or 
joint capsule inflammation or bursitis, a report on the up-rcguladon of genes such as 
interleukitt-3, manganese superoxide dismutase, Grooe, mctalloproteinase matix- 
metallo-dastase, ferritin light chain may have some correlation to tissue inflammation, 
and down-regulation of genes such as insulin-like growth factor and its receptor may 
be correlated to recoveiy from such a condition). The identity of these genes cannot be 
predicted to be assodated to any condition unless the above described methodology is 
used and databases on relative expression of genes for particular conditions have been 
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compiled^ Therefore a screeniDg test covering all genes may need to be peifbrmed fiist 
and a second^ more specific test then zpplxed. 

• The overall pattern of gene expression and any oorreladon to paiticular conditions. 
For example, animals io heavy training may have a gene "fingeiprint" that is different 

5 to animals being spelled from training. 

• Individual pattern of gene expression (ie* the shape of the gene expression pattern 
over a time course or multiple samples taken over a period may change as an ^ntmai 
recovers fiom a condition) . , 

• Changes to a pattern of gene expression* gene expression profile or level for a 
10 single am'mal over a time period or for successive tests. 

• Clusters of genes up-regulated or down-regulated in a particular condition 

• Pathways of genes up-regulated or down*regulated in a particular condition 

• Correlations between genes up-regulated or down-regulatcd and known conditions, 
or stage of condition, or influence 

15 • Known therapies to ameliorate the condition or enhance desired effects 

• Specialist pathologist written interpretation 

• Relevant information of use to veterinarians, medical practitioners, ownet^ traineis 
• and athletes 

• Collections of data on groups of animals under specific management regimes 

20 

Throughout the specification the aim has been to describe the preferred embodiments of 
the invention without limiting the invention to any one embodiment or specific collection 
of features. It would therefore be appreciated by those of skill in the art that, in Ught of the 
instant disclosure, various modifications and changes can be made in the particular 
25 embodiments exemplified without departing fiom the scope of the present invention. For 
example, the examples described hciein may be used with performance animals other than 
horse, for example human, dog ati^ camel. 



AU references, inclusive of patents, patent sq^ylications, scientific documents and computer 
30 programs, referred to in this specification are herein incorporated by reference in its 
entirety. 
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THE CLAIMS DEFINING THE INVENTION ARE AS FOLLOWS: 
1) A method of detennimng the status of a subject, the method xncIudiDg: 

a) Obtaining subject data, the subject data including one or more parameter values, at 
least one of the parameters being indicative of the current biological status of the 

5 subject; 

b) Comparing the subject data to predetermined data, the predetermined data 
including for each of a number of individuals: 

i) One or more values for the parameters for the respective individual, at least 
some of the individuals having a number of conditions relevant to the status of 

10 the individual, the number of parameters being statistically sufBcient to 

distinguish eadi of the conditions; and, 

ii) Antndicationof the status of the respective individual; and, 

c) Determining the status of the subject in accordance with the results of the 
comparison. 

IS 2) A method according to claim 1, the number of parameters being greater than 100. 

3) A method according to daim 1, the number of parameters being greats than 1000. 

4) A method according to daim 1, the number of parameters being less than 6000. 

5) A method according to daim 1, the method induding determining any conditions 
displayed by the user. 

20 6) A method according to daim S, the method induding determining the ability of the 
subject to p^orm in a sporting and/or ladng event in accordance with any determined 
conditions. 

7) A method according to daim 1, the method of performing the comparison induding 
causing the second processing system to: 

25 a) Obtain a set of templates, the set of templates representing differences between 
groups of individuals; and» 
b) Use the templates to classify the subject data into a re^ective one of the groiq>s. 

8) A method according to daim 7, the method induding detenniiiing one or more 
conditions in accordance with the determined groups. 

30 9) A method according to daim 1, the parameters being represoitative of the level or 
abundance of a molecule selected from one or more of : 
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a) A polynucleotide; 

b) A polypeptide; and, 

c) A polysaccharide. 

10) A method according to claim 1, the method including: 

S a) Receiving confilnnationofthe determined status; and, 

b) Updating the predetermined data in aocoitlance with the confirmed status and the 
subject data. 

11) A method according to claim 1, the predetermined data indudiiig phcnoiypic 
information of the individuals, and the subject data including phenotypic information 

10 regarding the subject, the phenotypic infonnation including details of one or more 
phenotypic traits. 

12) A method according to claim 11, the method including comparing the subject data to 
predetermined data for individuals having one or more phraoiypic traits in common 
with the subject 

IS 13) Apparatus for determining the status of a subject, the apparatus including a processing 
system adapted to: 

a) Obtain subject data, the subject data including one or more parameter values, at 
least one of the parameters being indicative of the current biological status of the 
subject; 

20 b) Compare the subject data to predetermined data, the predetennined data including 
for each of a number of individuals: 

i) One or more values for the parameters for the fespeciive individual, at least 
some of the individuals having a number of conditions relevant to the status of 
the individual, the number of parameters being statistically sufficient to 

25 distinguish each of the conditions; and, 

ii) An indication of the status of the respective individual; and, 

c) Deiermine the status of the subject in accordance with the results of the 
comparison, 

14) A computer program product for determining the status of a subject, the computer 
30 program product including computer executable code which when executed on a 
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siritable processing system causes the processing system to pexfoim the method of 
daiml. 

15) A method of aUowii^ a user to determine the status of a subject* the method including: 

a) Receiving subject data from the user via a communications network, the subject 
5 data including one or more paramet^ values, at least one of the parameter being 

indicative of the current biological status of the subject; 

b) Causing the base station to: 

i) Compare the subject data to predetermined data, the predetermined data 
including for eadi of a number of individuals: 

10 (1) One or more parameter values for the respective individual; and, 

(2) An indication of the status of each individual; and, 

ii) Determine the status of the subject in accordance with the results of the 
comparison; andy 

c) Transferring an indication of the status of the subject to the user via the 
IS communications network. 

16) A method according to claim IS, the method indtiding; 

a) Having the user determine the subject data using a remote end station; and, 

b) Transfeiring the subject data from the end station to the base station via the 
communications network. 

20 17) A method according to claim IS, the base station including first and second processing 
systems, the method including: 

a) Transferring the subject data to the first processing system; 

b) Transferring the subject data to the second processing system; and, 

c) Causing the second processing system to perform the .comparison. 
25 18) A method accordiog to claim 17, the method including: 

a) Transferring the results of the comparison to the first processing system; arKl, 

b) Causing the first processing system to determine the status of the subject. 
19) A method according to claim 17, the method including at least one of: 

a) Transferring the subject data between the communications network and the first 
30 processing system through a first firewall; and. 
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b) Ttansfcning the subject data between the first and the second processing systems 
through a second lirewalL 

20) A method according to claim 17, the second processing system being coupled to a 
database adapted to store the predetermined data, the method including: 

5 a) Querying the database to obtain at least selected {mdetermined data from the 
database; and, 

b) Compare the selected predetermined data to the subject data. 

21) A method -according to claim 17, the second processii^ system being coupled to a 
subject database, the method including storing the sul^ect data in the subject database. 

10 22) A method according to daim IS, the stams including details of any conditions of the 
individuals, the method including determining any conditions displayed by the user. 
23) A method according to daim 22, the method including determining the ability of the 
subject to perform in a sporting and/or racing event in accordance with any determined 
conditions. 

15 24) A method according to daim 22, the method of performing the comparison induding 
causing the second processing system to: 

a) Obtain a s^ of templates, the set of templates representing differences between 
groups of individuals; and, 

b) Use the templates to dassify the subject data into a respective one of the groups. 
20 25)A method according to daim 2A, the method including determining one or more 

conditions in accordance with the determined groups. 

26) A method according to daim IS, the parameters bciog representative of the level or 
abundance of a molecule selected from one or more of: 

a) A polynucleotide; 
25 b) A polypeptide; and, 

c) A polysaccharide. 

27) A method according to daim 15, the method induding having the user determine the 
subject data using a secure array, the secure array having a number of features each 
located at respective position on the array, and a respective serial number, the method 

30 including causing the base station to: 

a) Determine the serial number from the subject data; 
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b) Detennine a layout indicating the position of each feature on the array; 

c) DeteiminiAg the parameter values in accordance with the determined layout^ and 
the subject data. 

28) A method according to daim 15, the method including: 

S a) Receiving confirmation of the deteiToined status from Oie user; and, 

b) Updating the predetermined data tn ao^rdancc with the confirmed status and the 
subject data. 

29) A method according to daim 27i the features including at least one of: ■ . 

a) An ohgonudeotide; 
10 b) A peptide; and, 

c) An antibody. 

30) A method according to claim IS, the predetermined data induding pheaotypic 
information of the individuals and the subject data induding phenotypic information 
regarding the subject, the phenotypic information induding details of one or more 

IS phenotypic traits. 

31) A method according to daim 17, the method including comparing the subject data to 
predetermined data for individuals having one or more phenotypic traits in common 
with the siibject. 

32) A method according to claim IS, the method induding causing the base station to: 

20 a) Determine payment information, the payment information representing the 
provision of payment by the user; and, 

b) Perform the comparison in response to the determination of the payment 
infonnation. 

33) A base station for determining the sUtus of a subject, the base station induding: 

25 a) A store mediod for storing predetermined data, the predetermined data including 
for each of a number of individuals: 

i) One or more parameter values^ at least one of the parameters being indicative of 
the current biological status of the indrvidual; . 

ii) An indication of the status of the individual; and, 

30 b) A processing system, the processing system being adapted to: 
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i) Receive stibject data from the user via a ccmfnunicatioxis network, the subject 
data including one or more parameter values; 

ii) Compare the subject data to the predetermined data; 

iii) Determine the status of the subject in acooidance with the results of the 
5 comparison; and, 

iv) Output an indication of the status of the subject to the user via the 
communications network. 

34) A base station according to claim 33, the processing system being adapted to leceive 
subject data from a remote end station adapted to determine the subject data. 
10 35) A base station according to claim 33, the processing system including: 

a) A first processing system adapted to: 

i) Receive the subject data; and 

ii) Determine the status of the subject in accordance with the results of the 
comparison; and, 

15 b) A second processing system adapted to: 

i) Receive the subject data fiom the processing system; and, 

ii) Perform the comparison; and, 

iii) lYansfer the results to the first processing system* 

36) A base station according to daim 35, the base station including: 

20 a) A first firewall fi>r coupling the first processing system to the communications 
network; and, 

b) A second firewall for coupling the first and the second processing systems. 

37) A base station according to daim 35, the processing system being coupled a subfect 
database, the processing system adapted to store the subject data in the sut>|ect 

25 database* 

38) A method according to claim 35, the method of performing the comparison induding 
causing the second processing system to: 

a) Obtain a set of templates, the set of templates representing differences between 
groups of individuals; and, 
30 b) Use the templates to classify the subject data into a respective one of the groups. 
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39) A method according to daim 38, the method including detennimng one of more 
conditioDS in accordance with the determined groups. 

40) A base station according to claim 33, the subject data being determined using a secure 
array, the secure array having a number of features each located at respective position 

5 on the array, and a respective serial number, the processing system being adapted to: 

a) Determine the serial number from the subject data; 

b) Determine a layout indicating the position of each feature on the array; 

c) Determining the parameter values in accordance with the determined layout, and 
the subject data. 

10 41) A base station according to claim 33, the processing system being adapted to: 

a) Receive confirmation of the determined ability; and, 

b) Update the predetermined data in accordance with the determined ability and the 
subject data. 

42) A base station according to claim 33, the predetermined data including phenotypic 
IS infiormation of the individuals, and the subject data including phenotypic information 

regarding the subject, the phenotypic information including detafls of one or more 
phenotypic traits. 

43) A computer program produa for determining the status of a subject, the computer 
program produa indudi^g computer executable code which when executed on a 

20 suitable processing ^em causes the processing system to perform the method of 
daim 16. 

44) An end station adapted to determine the status of a sutiject, the end station Including a 
processor adapted to: 

a) Determine subject data from the user via a communications network, the subject 
25 data including one or more parameter values, at least one of the parameter being 

indicative of the current biological status of the subject; 

b) IVansfer the subject matter to a base station via a communications networic, the 
base station being adapted to: 

i) Compare the subject data to predetermined data for one or more individuals, the 
30 predetermined data including: 

(1) One or more parameter values for the respective individual; and, 
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(2) An indication of the status of each individual; and, 
ii) Detennine the status of the subject in accordance with the results of the 
compadson; and, 

c) Receive an indication of the status of the subject lo the user via the 
5 communications network. 

45) A computer program product fbr determining the status of a subject, the computer 
pipgram product including computer executable code which when executed on a 
suitable processing system causes the processing system to operate as an end station 
according to claim 44. 

10 46) A method of determining the ability of a subject to perform in a ^rting and/or racing 
event, the method indudii^: 

a) Obtaining subject data, the subject data induding one or more parameter values, at 
least one of the parameter being indicative of the current biological status of the 
subject; 

IS b) Comparing the subject data to predetermined data, the predetermined data 
including for each of a number of individuals: 

i) One or more parameter values fbr the respective individual; and, 

ii) An indication of the status of each individual; 

c) Dctennimng the status of the subject in accordance with the results of the 
20 comparison; and, 

d) Providing an indicadon of the ability in accordance with the results of the 
comparison. 

47) A method according to claim 46, the status of each individual indicating any conditions 
displayed by the user, the method including: 

25 a) Determining any conditions displayed by the user in accordance with the results of 
the comparison; and, 
b) Determining the ability in accordance with the determined conditions. 

48) Apparatus for determimng the ability of a subject to perform in a sporting and/or radng 
event, the apparatus including a processing system adq>ted to: 
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a) Obtain subject data, the subject data incloding one or more pazameter values, at 
least one of the parameter being indicative of the current biological status of the 
subject; 

b) Compare the subject data to piedetermined data, the predetennined data including 
for each of a number of individuals: 

i) One or more parameter values for the respective individual; and, 

ii) An indication of the status of each individual; 

c) Detezxnine the status of the subject in accordance with- the results of the 
comparison; and, 

d) Provide an indication of the ability in accordance with the results of the 
comparison. 

49) A computer program product for determining the ability of a subject to perform in a 
sporting and/or racing event, the computer program product including computer 
executable code which when executed on a suitable processing system causes the 
processing system to perform the method of claim 46. 

50) A method of providing secure arrays for use, each anay including a number of 
predetennined fieatures, the method including: 

a) Detcrmim'ng a number of respective feature layouts, each layout representing the 
positioning of each feature on a respective array; 

b) Determining a number of serial numbeis, each serial number corre^onding to a 
respective layout; 

c) Generating a number of arrays, each array being generated in accordance with a 
respective layout, and including the conesponding serial number thereon, the serial 
number being used in processing used the airay. 

51) A method according to daim 50, the method being performed to provide the arrays on 
behalf of an entity, the method including providing an indication of the layouts and 
corresponding serial numbers to the entity, to thereby allow the entity to process the 
arrays. 

52) A method according to claim 50, the method of determining the layouts including: 
a) Determining a preferred layout; and. 
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b) Moving the position of one or more of the features from the position in the 
preferred layout to alternative position. 
53) A method according to daim 52, the method including; 
a) Detennining the of each feature; and, 
5 b) Exchanging the position of one or more features having different feature types. 
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